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EVALUATION 


This  report  is  the  Final  Report  on  the  contract.  It  covers 
research  done  on  CCD  corner-turning  memories  during  the  25  1/2  month 
period  15  September  1977  to  31  October  1979.  The  objective  of  the 
research  is  the  development  of  2-dimensional  CCD  corner-turning 
memories  with  integral  high-speed  demultiplexer/multiplexers  to 
provide  analog  data  reformatting  with  wide  dynamic  range,  for  such 
purposes  as  radar  doppler  processors,  matched  filtering  of  low  data 
rate  covert  spread  spectrum  communication,  and  high  resolution  spectral 
analysis  of  long  time  records.  Such  devices  were  designed,  fabricated, 
and  demonstrated,  and  working  samples  were  delivered  with  test  fixtures 
The  above  work  is  of  value  since  it  provides  a  key  device  needed 
for  accomplishing  signal  processing  in  USAF  command,  control, 
communication,  and  intelligence  systems.  Related  TPO  is  R5D. 

l/ukd  ^ 

VIRGIL  E.  VICKERS 
Project  Engineer 


SECTION  I 
INTRODUCTION 


Advancements  in  charge  coupled  device  technology  have  resulted  in  the 
addition  of  a  number  of  new  components  for  analog  signal  processing 
applications.  The  most  promising  developments  in  this  field  are  the 
integrated  chirp  Z-transform  (CZT)^  and  the  digitaj-analog  correlator, 2  both 
of  which  capitalize  on  the  tremendous  effective  computational  power  of 
analog  transversal  filters  to  accomplish  complex  signal  processing  functions 
with  modest  volume,  weight,  and  power  dissipation.  These  devices  can  be 
thought  of  as  linear  operators  that  transform  a  fixed  length  input  record  V 
into  an  output  record  U  of  identical  length  according  to  U  *  LV.  In  the 
case  of  the  chirp  Z-transform  L  corresponds  to  the  discrete  Fourier  transform 
operator  via  the  chirp  Z-algorithm,  while  in  the  digital-analog  correlator,  L 
corresponds  to  a  programmable  scalar  product  operator. 

Unfortunately,  in  a  number  of  potentially  interesting  applications  the 
input  data  sequence  does  not  occur  in  the  form  of  consecutive,  fixed  length 
input  records.  A  particularly  illustrative  example  is  the  case  in  which  it  is 
desired  to  utilize  the  CZT  processor  to  accomplish  doppler  resolution  of  the 
returns  in  a  pulsed  radar  system.  The  situation  is  illustrated  in  Figure 
1(a),  which  shows  the  radar  video  in  each  transmitter  pulse  repetition 
interval  ( PR  I )  subdivided  into  range  bins  that  are  dictated  by  the  range 
resolution  of  the  radar  circuitry.  To  accomplish  the  doppler  processing, 
the  returns  in  each  range  bin  must  be  analyzed  over  a  number  of  PRIs 
dictated  by  the  number  of  points  in  the  transform.  Use  of  an  integrated  CZT 
processor  in  this  application  requires  that  the  input  record  be  sequential 
in  PRIs,  whereas  the  radar  video  is  sequential  in  range  bins.  Figures  1(b) 
through  1(e)  illustrate  the  problem. 

The  radar  video  shown  in  Figure  1(b)  is  low  pass  filtered  and  sampled 
at  a  rate  commensurate  with  the  range  resolution  resulting  in  Figure  1(c). 
For  illustrative  purposes  a  constant  amplitude  return  (stationary  target)  is 

1 


2 


T 


\sK 


A> - 

b)  RAW  VIDEO 


c)  FILTERED  AND  SAMPLED  VIDEO 


+ 


Rang*  bin 
1  2  3  4  5  6  7  8 

i  i  t  I  i  i  I  I  I 

_n_n _  i 

_  2 

PRI 

3 

-H-u -  « 

d)  VIDEO  IN  MATRIX  FORMAT 


Figure  I  Video  Reformatting  in  Radar  Doppler  Processor 
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shown  in  range  bin  1,  and  a  time  varying  return  (moving  target)  is  shown  in 
range  bin  3.  The  serial  video  consists  of  four  records,  each  of  length  8, 
which  are  grouped  i n  a  4  x  8  matrix  format  in  Figure  1(d).  The  rows 
correspond  to  pulse  intervals  and  columns  to  range  bins  within  each  interval. 
If  we  consider  Figure  1(d)  as  representing  the  contents  of  a  **  x  8  memory 
array  into  which  the  sampled  video  was  loaded  on  a  row-by-row  basis,  then  it 
can  be  seen  that  the  desired  reformatting  operation  consists  in  reading  the 
contents  of  the  memory  column-by-column.  This  results  in  eight  records,  each 
four  samples  in  length  and  each  sequential  in  pulse  interval,  as  required  for 
doppler  processing  as  shown  in  Figure  1(e). 

It  can  be  shown  that  the  minimum  number  of  storage  locations  required  to 
accomplish  the  reformatting  operation  on  M  records  of  length  N  is 
(N  -  1)  (M  -  1).  However,  the  access  and  transfer  control  requirements  for  a 
CCO  memory  of  this  type  are  quite  complex.  Considerable  simplification 
results  from  the  use  of  an  M  x  N  array  (illustrated  in  Figure  1)  in  that  all 
access  and  transfer  operations  can  be  made  to  occur  either  sequentially  or  in 
parallel.  Because  of  the  nature  of  the  reformatting  operation,  this  type  of 
memory  is  referred  to  as  a  "corner  turning  memory"  (CTM). 

The  configuration  of  the  hypothetical  doppler  processor  subsystem  is 
shown  in  Figure  2.  A  dual  channel  analog  reformatting  memory  allows  in-phase 
(I)  and  quadrature  (Q)  video  components  from  a  coherent  radar  to  be  used  in 
conjunction  with  a  complex  CZT  processor  to  preserve  the  sense  of  the  doppler 
information  (approaching  vs  receding  targets).  One  particularly  crucial 
aspect  in  the  complex  doppler  processor  involves  gain  tracking  between  the  I 
and  Q  reformatting  memories.  To  ensure  that  the  doppler  image  response 
(having  doppler  sense  opposite  that  of  the  target)  is  sufficiently  attenuated, 
the  gain  between  I  and  Q  channels  must  continually  track  to  within  limits 
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given  by 


where  DR  is  the  desired  processor  dynamic  range  in  dB. 


Analog  CCD  reformatting  memories  have  been  recently  applied  in  high-speed 
doppler  processors  in  conjunction  with  surface  acoustic  wave  device  chirp 
transform  processors .3-6  These  memories  are  predominantly  line-addressed  CCD 
structures  utilizing  N  delay  lines  each  having  M  stages.  A  typical 
configuration  is  shown  in  Figure  3,  again  using  the  4  x  8  format  as  an 
example.  The  individual  CCD  input  and  output  circuits  are  commutated,  usually 
by  means  of  on-chip  MOS  circuitry. 


The  memory  is  loaded  in  a  row-by-row  fashion  by  commutating  the  input 
circuit  at  the  desired  sampling  rate.  After  each  row  has  been  entered,  it 
is  transferred  one  stage  toward  the  output.  When  the  memory  is  filled, 
delay  line  1  contains  the  samples  from  range  bin  1,  delay  line  2  contains 
those  from  range  bin  2,  etc.  Reformatting  is  accomplished  by  emptying  the 
array  column-by-column  using  the  output  commutator.  Since  data  cannot 
generally  be  entered  during  this  output  phase,  the  duty  cycle  of  this 
configuration  is  limited  to  501. 

Note  that  the  inverse  operation  is  possible  in  square  arrays  in  which 
data  can  be  entered  column-by-column  during  the  output  phase  of  one 
reformatting  operation,  then  emptied  row-by-row  during  the  input  phase  of  the 
next.  This  specific  case  results  in  100%  duty  cycle;  however,  the  key 
limitation  in  the  CCD  delay  line  approach  Is  the  fixed  pattern  gain  and  offset 
variations  that  occur  among  the  delay  lines  due  to  MOS  threshold  voltage 
variations  and,  to  a  lesser  extent,  CTE  variations. 6  In  the  doppler  processor 
application  these  result  in  gain  and  offset  variations  among  the  length  M 
output  records  in  the  column-output  mode  and  among  samples  in  each  record  in 
the  row-output  mode.  The  former  may  be  tolerated  in  some  applications; 
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Figure  3  Line  Addressed  Reformatting  Memory  Organization 


however,  the  latter  usually  results  In  an  intolerable  s I gna I - to-no i se 
degradat Ion. 

Based  on  these  architectural  considerations,  a  processor  brassboard  was 
developed  under  a  previous  contract  to  evaluate  the  configuration  Illustrated 
in  Figure  2  using  a  SAW  chirp  transform  module.  The  I  and  Q  components  of  the 
radar  video  are  to  be  derived  from  a  synchronous  detector  in  the  radar 
receiver  and  stored  in  separate  memory  (range  store)  modules.  Each  module 
consists  of  a  number  of  66  x  66  cell  CCD  memory  blocks  required  to  accommodate 
the  range  swath  of  interest.  For  the  brassboard  two  blocks  per  module  were 
employed,  although  this  is  expandable  to  allow  the  desired  coverage  of  the 
radar  pulse  interval.  Thus,  the  range  stores  in  the  brassboard  can 
accommodate  132  range  bins  (I  and  Q)  collected  over  66  PRIs  in  the  search  mode 
and  66  range  bins  ( I  and  Q  for  both  E  and  &  channels)  over  66  PRIs  in  the 
track  mode. 

The  range  stores  are  emptied  at  a  12.5  MHz  rate  into  I  and  Q  up 
converters  to  generate  single  sideband  doppler  information  at  a  318  MHz  IF  for 
subsequent  spectral  analysis  by  the  64-point  SAW  CZT  module.  (The  use  of  66 
cells  in  the  range  stores  allows  two  "burn-pulses"  for  settling  of  transients 
before  spectral  analysis  is  initiated.)  The  SAW  CZT  module  consists  of  two 
complex  CZT  channels  that  are  commutated  by  processor  timing  to  allow 
continuous  system  operation.  The  use  of  the  complex  CZT  realization  is 
required  to  maintain  the  desired  phase  information. 

The  organization  of  the  CCO  memory  chip  is  illustrated  by  the  block 
diagram  in  Figure  4.  The  configuration  shown  consists  of  sixty-six  66-stage 
CCO  delay  lines  and  the  circuitry  required  to  address  individual  input  and 
output  terminals  on  a  line-by-line  basis.  The  corner  turning  operation  is 
performed  by  sequentially  switching  between  a  parallel  mode  in  which 
consecutive  samples  are  read  into  and  out  of  consecutive  delay  lines  and  a 
serial  mode  in  which  consecutive  samples  are  read  into  or  out  of  one 
particular  delay  line. 
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The  design  approach  used  in  the  range  stores  is  based  on  the  two-phase 
CCD  structure,  which  can  be  realized  with  the  double-levei  polysilicon  gate 
process  developed  at  Tl.  These  delay  lines  are  operated  in  the  so-called 
"phase  and  one-half"  mode  in  which  one  set  of  gate  electrodes  is  clocked, 
while  the  other  is  maintained  at  an  intermediate  dc  level.  By  properly 
designing  the  output  circuit,  the  charge  transfer  and  sensing  operations  can 
be  controlled  with  a  single  clock  pulse.  When  the  CCD  clock  is  off,  the 
signal  charge  is  stored  in  i on- imp  1  an t ed  potential  wells  beneath  the 
nonclocked  electrodes. 

Referring  to  Figure  4 ,  a  logic  "1"  is  entered  at  the  input  of  the 
66-stage  MOS  shift  register  and  propagates  through  the  structure  sequentially 
addressing  MOS  switch  transistors,  which  coupl?  individual  CCD  clock  lines  to 
the  CCD  clock  buses.  Serial  or  parallel  operation  is  determined  by  the  shift 
register  clock  rate.  Signal  charge  is  transferred  down  the  CCD  delay  lines 
and  detected  at  individual  output  circuits.  The  commutating  shift  register  is 
a  standard  dynamic  MOS  ratio-less  configuration  with  an  additional  transfer 
switch  connected  to  the  CCD  clock  switch  transistor  that  operates  in  a 
bootstrap  mode. 

The  CCD  delay  lines  are  each  66-stage  buried  channel  devices  having  6  mil 
(152.4  pm)  channel  widths  and  0.6  mil  (15.24  pm)  gate  lengths.  The  input 
circuits  are  dual-gate  surface  channel  configurations  and  can  be  operated  in 
either  the  "diode  cutoff"  or  "potential  equilibration"  mode.  The  buried 
channel  is  coincident  with  the  edge  of  the  implanted  well  under  the  first 
clocked  electrode. 

To  preserve  dynamic  range  at  the  clock  rates  required,  it  was  determined 
that  each  delay  line  should  have  an  individual  source-follower  output 
transistor.  Furthermore,  power  dissipation  requirements  dictated  that  each 
output  transistor  should  be  turned  off  when  not  in  use.  This  was  accomplished 
with  an  additional  reset  transistor  at  the  output  of  each  delay  line  end  the 


use  of  external  current  summing  in  the  output  circuit.  As  indicated  in 
Figure  5,  all  output  nodes  are  tied  in  parallel  and  connected  to  the  emitter 
of  a  PNP  common  base  current  summing  amplifier.  The  value  of  Vg  sets  the 
voltage  at  the  emitter  and  hence,  the  common  output  node  voltage.  This  is 
adjusted  such  that  the  preset  level  at  the  output  diode  of  the  currently 
addressed  delay  line  is  sufficient  to  turn  on  its  respective  source  follower. 

The  collector  current  of  the  summing  amplifier  then  follows  the  drain 
current  of  the  output  follower  and  is  sensed  by  R|_*  Coincident  with  the 
operation  of  the  shift  register,  a  reset  pulse  is  applied  to  the  gate  of  the 
additional  reset  transistor.  This  pulls  the  output  diode  voltage  at  all  delay 
lines  down  to  VReset,  which  is  adjusted  to  a  level  below  the  emitter 
voltage  of  the  summing  amplifier  (VReset  =  Vg  is  sufficient),  thereby 
turning  off  a  1 1  output  source-followers.  The  next  preset  operation  turns  on 
the  output  circuit  of  the  next  delay  line,  and  the  output  voltage  follows  the 
source-follower  drain  current  until  the  next  reset  pulse  occurs. 

Figure  5  is  a  photomicrograph  of  the  completed  memory  chip.  Due  to  size 
limitations,  the  structure  is  composed  of  thirty-three  66-stage  shift  register 
addressed  delay  lines.  An  integral  source-follower  is  attached  to  the  last 
stage  of  the  shift  register  to  couple  it  to  a  second  bar,  thereby  forming  the 
full  66  x  66  memory  array.  The  dimensions  of  the  chip  are  156  mils  x  293  mils 
(3.96  mm  x  7.AA  mm) . 

Figure  6  illustrates  the  operation  of  two  interconnected  memory  chips 
performing  time  compression  of  an  analog  test  signal  at  a  12.5  MHz  clock  rate. 
The  top  two  traces  are,  respectively,  the  test  signal  and  memory  output 
(before  samp  1 e -and- ho 1 d )  on  the  same  time  scale.  The  test  signal  is  sampled 
during  the  parallel  mode,  which  can  be  di/ferent iated  from  the  serial  mode  by 
level  variations  In  the  latter  due  to  thre^K^Jd  voltage  variations.  The 
amplitude  of  this  fixed  pattern  noise  component  is  seen  to  be  of  the  same 
order  as  the  peak-to-peak  signal  swing.  In  the  doppler  processor  output  these 
variations  appear  in  the  "0"  doppler  cell  and  can  be  externally  inhibited. 


SEQUENTIALLY  LINE  ADORES 
OPERATION  AT  12.5  MHz 


Figure  6  Sequentially  Line  Address, 
at  12.5  MHz 
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The  primary  performance  limitation  observed  in  the  system  tests  resulted  from 
gain  variations  between  I  and  Q  channels,  which  are  related  to  the  threshold 
voltage  variations  through  the  transconductance  (gm)  of  the  individual  output 
source-followers.  These  gain  variations  were  observed  to  be  in  the  range  of  3 
to  5$  and  resulted  in  poor  suppression  of  the  doppler  image  response. 

The  limitations  imposed  by  MOS  threshold  variations  are  inherent  in 
line-addressed  analog  memories  and  stimulated  the  development  of  a  unique 
approach  to  the  problem.  Since  the  reformatting  operation  consists  in 
interchanging  row  and  column  transfer  of  the  stored  data  sequence,  the  most 
direct  realization  of  a  CCD  reformatting  memory  involves  a  structure  in  which 
charge  can  be  transferred  either  vertically  (row-by-row)  or  horizontally 
(column-by-column)  in  the  array. 7  The  structures  controlling  the 
two-dimensional  charge  transfer  operation  are  complicated,  and  the  potential 
advantages  of  this  direct  realization  are  lost  in  the  higher  risks  associated 
with  the  sophisticated  design.  In  addition,  there  is  no  performance  advantage 
over  line-addressed  structures  if  individual  input  and  output  cells  are 
employed  at  the  edge  of  the  array.  However,  the  two  dimensional  transfer 
approach  offers  the  potential  for  substantially  improved  performance  with  the 
integration  of  a  CCD  multiplexer  and  demultiplexer  to  provide  serial 
interfaces  with  the  memory  array.  A  block  diagram  of  the  structure  appears  in 
Figure  7.  The  input  demultiplexer  performs  a  ser i al -to-para 1  lei  conversion 
and  loads  the  memory  array  row-by-row.  During  the  load  cycle,  charge  is 
transferred  vertically  in  the  array.  When  all  rows  have  been  loaded,  the 
array  is  switched  to  a  horizontal  transfer  mode,  and  the  output  multiplexer 
performs  the  required  paral lel-to-ser ial  conversion  of  the  data  stored  in  each 
column.  The  resulting  CCD  structure  has  a  single  input  port  (voltage-charge 
conversion)  and  a  single  output  port  (charge-voltage  conversion).  All 
intermediate  data  manipulations  involve  CCD-type  charge  transfers.  Thus,  the 
sources  of  the  fixed  pattern  variations  that  cause  degraded  dynamic  range  in 
the  line-addressed  structures  have  been  eliminated,  and  full  dynamic  range  is 
expected  over  the  entire  reformatted  data  array  (MN  samples). 
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In  square  arrays  (M  ■  N)  an  additional  demultiplexer  can  be  added  at  the 
left  of  the  memory  array  in  Figure  7  and  an  additional  multiplexer  added  at 
the  bottom  to  allow  simultaneous  loading  and  unloading.  Full  duty  cycle  is 
thereby  achieved;  in  addition,  the  array  can  be  operated  as  a 
ser i a  l -para  1  1  el -ser  i  a  1  (SPS)  delay  line.  The  latter  feature  greatly 
simplifies  device  evaluation. 

The  primary  objective  of  the  program  executed  under  Contract  No. 
F19628-77-C-0234  is  the  development  of  a  prototype •  ana  1  og  CCD  reformatting 
memory  using  two-dimensional  charge  transfer  cells  and  integral  CCD 
multiplexers  and  demultiplexers.  A  secondary  objective  is  the  development  of 
CCD  process-compatible  bipolar  transistors  for  use  in  high-speed  interface 
circuits.  Three  chip  designs  were  executed  during  the  program: 

•  32  x  32  element  prototype  CTM 

•  64  x  64  element  prototype  high-speed  CTM 

•  Bipolar  circuit  test  chip. 

The  32  x  32  design  was  accomplished  to  evaluate  the  basic  memory 
structure.  Design  and  evaluation  are  discussed  in  Sections  II  and  III, 
respectively.  The  64  x  64  element  design  was  improved  to  accommodate  higher 
operating  speeds  and  is  discussed  in  Sections  IV  and  V.  Design  and  testing  of 
the  bipolar  chip  are  discussed  in  Sections  VI  and  VII. 


SECTION  1  1 

32  x  32  ELEMENT  CTM  DESIGN 


A.  Two-Dimensional  Charge  Transfer  Cell 

All  basic  electrode  arrangements  for  two-dimensional  charge  transfer 
arrays  employing  two-,  three-,  or  four-phase  clocking  schemes  were  described 
by  Sequin  in  197^.7  The  subject  seems  to  have  been  only  of  academic  interest, 
since  at  the  onset  of  the  program  the  construction  and  operation  of  a  true 
two-dimensional  charge  transfer  array  had  not  been  described  in  the 
literature.  The  first  design  task,  therefore,  involved  selection  of  a 
topology  that  was  consistent  with  available  fabrication  processes. 

Consideration  of  the  number  of  independent  clock  phases  required  for 
two-dimensional  charge  transfer  immediately  suggested  the  use  of  the 
"two-phase"  CCD  process  that  was  developed  at  Texas  Instruments  for  production 
of  high  density  CCD  digital  memories.  The  salient  features  of  this  process 
include: 

•  two  polysilicon  gate  levels 

•  self-aligned  CCD  well  implants 

•  self-aligned  source/drain  diffusion 

•  buried  channel  CCD  capability. 

The  first  2- D  transfer  scheme  considered  was  a  four-phase  approach  using 
diagonally  interconnected  electrodes.  The  topology  and  operation  of  this 
structure  are  illustrated  in  Figure  8,  which  shows  horizontal  transfer. 
Vertical  charge  transfer  is  accomplished  by  interchanging  the$i  and<?i  clock 
waveforms.  A  four-phase  clock  system  is  required,  and  the  array  can  (in 
principle)  be  realized  with  no  intralevel  interconnections  in  the  charge 
transfer  region.  Unfortunately,  inclusion  of  multiplexers  and  demultiplexers 
at  the  edges  of  the  array  obviate  this  apparent  advantage,  since  the  diagonal 
interconnections  cannot  be  extended  beyond  the  edges  of  the  array. 
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The  primary  disadvantage  of  this  approach  is  the  complexity  required  at 
the  multiplexer  and  demultiplexer  interfaces.  Note  that  charge  is  entered  at 
the  left  edge  of  the  array  at  three  different  times  during  one  clock  period. 
Charge  packets  from  a  CCD  demultiplexer  will,  however,  appear  simultaneously 
at  the  interface.  Thus,  an  extra  buffer  stage  is  required  between  the 
demultiplexer  and  the  memory  array  to  accommodate  the  timing  differences. 
Similar  problems  exist  at  the  array-multiplexer  interface. 

To  simplify  these  critical  interfaces,  a  more  s t r a i gh t f orwar d 
two-dimensional  cell  design  was  adopted  that  relies  on  interlevel 
interconnects  within  the  array.  The  structure  is  similar  to  a  three-gate 
level  design  discussed  by  Sequin  [Reference  7,  Figure  5(b)]  and  is  shown 
pictorial ly  in  Figure  9.  The  cell  consists  of  two  interleaved  two-phase  CCDs 
that  share  a  common  phase.  The  V  and  H  transfer  gates  are  both  first-level 
polysilicon  electrodes.  The  H  gates  are  continuous  polysilicon  in  the 
vertical  direction,  and  columns  are  interconnected  with  metal.  The  V  gates 
rely  solely  on  metal  interconnects.  The  common  phase  is  second-level 
polysilicon  that  completely  covers  the  array  except  for  the  openings  required 
for  contact  to  the  first-level  gates.  The  hatched  areas  indicate  the  barrier 
regions  in  the  two-phase  transfer  stages.  Note  that  horizontal  and  vertical 
transfers  are  accomplished  simply  by  enabling  the  corresponding  clock 
waveform.  Further,  all  charge  packets  enter  or  leave  the  array  simultaneously, 
thereby  allowing  the  interface  to  be  accomplished  with  a  single  transfer  gate 
between  demultiplexer  and  array  and  another  between  array  arid  multiplexer. 

Since  both  topologies  require  intralevel  interconnections  within  the 
array,  the  three-phase  transfer  cell  was  identified  as  exhibiting  the  lowest 
overall  technical  risk.  The  remainder  of  this  section  describes  the  design 
considerations  applied  to  the  realization  of  the  32  x  32  CTM  using  the 
three-phase  transfer  cell. 


Figure  10  shows  the  channel  stop,  gate,  and  contact  levels  for  the 
three-phase  transfer  cell  designed  to  be  compatible  with  the  design  rules  for 


Figure  9  Topology  and  Clocking  Scheme  for  the 
Three-Phase  Transfer  Ceil 
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the  "two-phase"  CCO  process.  Critical  feature  dimensions  are  labeled  A 
through  E  and  are  discussed  below.  The  horizontal  (H),  vertical  (V),  and 
common  phases  are  indicated  for  comparison  with  Figure  9. 

The  minimum  center-to-center  spacing  (pitch)  of  the  transfer  cells  is 
dictated  by  the  process  design  rules  that  are  set  by  photomask  geometry  and 
alignment  tolerances.  This  distance  can  be  determined  from  Figure  10  and  is 
given  by  P  *  B  +  E  +  2(A  +  C  +  0),  where 

A  *  minimum  intralevel  poly  separation  (0.3  mil)  7.6  pm 
B  *  minimum  poly  line  width  (0.3  mil)  7.6  pm 
C  *  minimum  poly-channel  stop  overlap  (0.2  mil)  5.1  pm 
D  =  minimum  contact-poly  separation  (0.2  mil)  5.1  I* 

E  *  minimum  contact  dimension  (0.3  mil)  7.6  pm 

The  resulting  pitch  determined  by  the  "two-phase"  design  rules  is  P  *  2.0 
mils  (50.8  pm).  Smaller  values  of  P  could  be  achieved  at  the  expense  of 
reduced  yield;  however,  for  the  purpose  of  designing  this  prototype  structure 
the  design  rules  were  strictly  observed. 

Figure  11  is  a  photomicrograph  of  the  completed  chip  showing  the  details 
of  the  two-dimensional  charge  transfer  cell.  The  orientation  is  the  same  as 
in  Figure  10,  and  the  metal  interconnections  among  the  individual  V  transfer 
gates  and  the  H  gate  column  can  be  seen.  Cell  Interconnects  are  redundant  to 
provide  a  measure  of  protection  against  a  few  open  metal  lines. 

B.  Input  Demultiplexer 

The  input  demultiplexer  accomplishes  the  ser  ial-to-paral  lei  conversion 
required  to  load  the  signal  charge  packets  into  the  memory  array.  In  its 
simplest  form  a  CCO  demultiplexer  consists  of  a  serial  shift  register  having 
an  output  tap  on  each  stage.  The  input  demultiplexer  for  the  CTM  is  more 
complex  due  to  constraints  imposed  by  the  desired  charge-coupled  Interface 
with  the  memory  array: 
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Figure  II  Photomicrograph  of  the  Two-Dimensional 
Transfer  Cell 


•  The  pitch  of  the  shift  register  must  match  that  of  the 
two-dimensional  transfer  cells.  This  requirement  is 
inconsistent  with  maintaining  good  charge  transfer  effi¬ 
ciency  (CTE)  in  the  shift  register  and  requires  the  use 

of  a  four-phase  delay  stage  having  0.5  mil  gate  length 
per  phase.  To  further  improve  the  CTE  capabilities  of 
the  shift  register,  each  phase  includes  a  depletion  well 
implant  that  results  in  a  0.2  mil  barrier  length.  In  addi¬ 
tion,  the  entire  channel  is  implanted  to  operate  in  a 
buried  channel  mode. 

•  The  charge-coupled  interface  requires  that  the  parallel 
transfer  be  accomplished  by  means  of  contiguous  polysilicon 
transfer  gates.  This  complicates  the  layout  due  to 
intralevel  polysilicon  separation  considerations. 

Figure  12  shows  the  channel  and  gate  levels  of  the  resulting 
demultiplexer  design.  Parallel  transfer  is  accomplished  by  inhibiting  the  03 
clock  and  enabling  the  XFR  gate.  Charge  stored  beneath  the  02  electrode  is 
then  transferred  to  the  XFR  gate  and  subsequently  to  the  second-level  gate  in 
the  two-dimensional  transfer  cell.  The  interface  between  the  02  ant)  XFR 
electrodes  is  slanted  to  accommodate  intralevel  gate  separation  requirements 
to  provide  increased  gate  width  and  to  minimize  charge  trapping  in  the 
demultiplexer,  which  could  degrade  CTE  during  the  serial  load  operation.  A 
photomicrograph  of  the  completed  chip  showing  the  demu 1 t iplexer -memory  array 
interface  appears  in  Figure  13.  Note  that  incomplete  charge  transfer  in  the 
parallel  transfer  operation  results  only  in  an  apparent  attenuation  of  the 
signal,  so  long  as  the  charge  remaining  the  the  demultiplexer  is  completely 
cleared  during  the  subsequent  load  operation. 

The  serial  input  circuit  consists  of  an  input  diode  and  two  surface 
channel  gates.  A  standard  CCD  output  circuit  consisting  of  a  reset  amplifier 
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and  a  source-follower  output  buffer  is  included  in  each  demultiplexer  to  fully 
characterize  the  performance  of  the  structure. 


C.  Output  Multiplexer 

The  function  of  the  output  multiplexer  is  to  perform  a  paral lel-to-ser iai 
conversion  on  the  32  signal  charge  packets  that  are  simultaneously  shifted  out 


of  the  memory  array.  In  addition  to  the  interface  consideration  applied  to 
the  demultiplexer  design,  the  multiplexer  interface  must  include  a  means  for 
clearing  of  any  residual  charge  following  the  parallel  transfer.  This 
function  is  inherent  in  the  operation  of  the  demultiplexer,  and  its  neglect  in 
the  multiplexer  can  lead  to  a  severe  degradation  of  the  apparent  CTE  of  the 
memory  array.  The  shift  register  portion  of  the  multiplexer  structure  employs 
the  same  four-phase  buried  channel  scheme  used  in  the  demultiplexer. 


The  channel  and  gate  levels  of  the  multiplexer  structure  are  illustrated 
in  Figure  14.  The  parallel  transfer  is  accomplished  between  an  extended 
second-level  memory  array  electrode  and  the  <t> 2  multiplexer  gate  by  means  of  a 
short  first-level  transfer  gate.  As  in  the  demultiplexer,  the  interface  is 
slanted  to  accommodate  intralevel  gate  spacing  and  to  provide  maximum  channel 
width.  An  individual  drain  diode  is  included  in  each  multiplexer  cell  and  is 
connected  to  the  extended  output  electrode  by  means  of  a  short  tab  added  to 
the  adjacent  first-level  memory  array  gate.  The  diode  is  reverse-biased  with 
a  positive  dc  voltage,  and  any  residual  charge  under  the  extended  second-level 
electrode  is  drained  when  the  adjacent  first-level  electrode  is  clocked  to  a 
high  level  prior  to  each  parallel  transfer.  A  photomicrograph  of  the 
completed  chip  showing  the  output  mu  1 1 ip  lexer -memory  array  interface  appears 
in  Figure  15. 


Each  multiplexer  includes  a  serial  "fat  zero"  input  circuit,  which  is 
also  used  to  characterize  the  serial  shift  register.  The  two  multiplexer 
channels  are  joined  in  a  c ommon  output  diode,  and  charge  sensing  is 
accomplished  with  a  single  reset  amplifier. 


26 


Figure  15  Photomicrograph  of  the  Multiplexer  -  Memory 
Array  Interface 
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D.  32  x  32  CTM  Chip  Organization 

Figure  16  is  a  photomicrograph  of  the  completed  32  x  32  CTM  chip.  The 
leads  required  to  make  connection  to  the  memory  array  clocks,  transfer  gates, 
and  diode  drain  were  routed  between  the  input  and  output  circuits  of  the 
multiplexer  and  demultiplexers.  Due  to  these  interconnection  requirements,  it 
was  not  possible  to  incorporate  a  single  input  circuit  for  use  with  both 
demultiplexers.  The  dc  offset  between  the  individual  demultiplexer  input 
circuits  (due  to  threshold  voltage  variations)  is  corrected  using  external 
dc-level  adjustments.  The  die  dimensions  are  112.5  mils  x  114  mils  (2.86  mm  x 
2.90  mm)  and  includes  47  bond  pads.  In  practice,  the  multiplexer  clocks  and 
02,  end  04  demultiplexer  clocks  are  bonded  in  parallel  across  the  chip, 
thereby  allowing  the  use  of  a  standard  40-pin  package. 

Experimental  results  are  presented  in  the  following  section. 
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Figure  16  Photomicrograph  of  the  32  x  32  Element  Reformatting 
Memory  Chip 


SECTION  III 

32  x  32  CTM  EVALUATION 


Three  lots  (12  slices/lot)  of  32  x  32  CTMs  were  processed  during  the 
program.  Significant  experimental  results  from  each  lot  are  discussed  in  this 
section.  Device  evaluation  was  accomplished  using  an  exerciser  that  was 
designed  and  constructed  to  allow  parallel  clocking  of  two  CTM  chips,  thereby 
allowing  dual  channel  (I  and  Q)  operation.  Timing  diagrams  and  circuit 
schematics  for  the  exerciser  are  included  in  Appendix  A.  The  clock  generation 
circuits  were  constructed  using  Schottky  TTL,  and  the  clock  driver  circuits 
employed  integrated  TTL-to-MOS  converters.  The  maximum  clock  rate  is  limited 
to  about  7  MHz  by  capacitive  loading  of  the  clock  drivers. 

A.  Lot  501 

Completed  slices  of  the  first  lot  were  received  in  March  1978,  and 
several  test  chips  containing  the  three-phase  CTM  structure  were  bonded  in 
40-pin  packages.  Initial  tests  indicated  a  high  incidence  of  intralevel 
shorts  on  the  second  polysilicon  gate  level.  No  completely  operational 
chips  were  found,  although  it  was  apparent  that  the  CCD  multiplexers  and 
demultiplexers  were  operational  on  a  number  of  devices. 

Further  investigation  revealed  that,  due  to  a  mixup  in  process 
specifications,  only  half  the  lot  (Slices  7-12)  were  fabricated  using  a  new 
polysilicon  etch  technique  developed  to  eliminate  intralevel  shorts.  It  was 
originally  intended  that  this  technique  would  be  used  exclusively  for  the 
entire  lot.  Subsequently,  devices  from  Slices  9  and  10  were  bonded  and 
tested,  and  a  number  of  operational  units  were  observed.  Successful  operation 
was  demonstrated  at  a  program  review  on  14  April  1978. 

In  the  course  of  further  testing  it  was  found  that  one  CCD 
demultiplexer  (at  the  top  of  the  structure  In  Figure  16)  was  not  operational 
as  a  result  of  a  minor  design  error.  Figure  17  illustrates  5  MHz  operation  of 
the  32  x  32  memory  in  the  corner-turning  mode,  in  which  the  memory  is  filled 
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(a) 


(b) 


Figure  1 7 


Memory  Operation  in  the  Corner-Turning  Mode  at  a  5  MHz 


Output  100  mV/di v 


Input  l  V/div 
Horizontal  50  psec/div 


Expanded  Output 
100  mV/div 


Horizontal  5  psec/div 


Clock  Rate 
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column-by-column  by  the  demultiplexer  at  the  left,  then  emptied  row-by-row 
through  the  multiplexer  at  the  bottom  of  the  array.  Figure  17(a)  shows  the 
triangular  test  waveform  and  CTM  output  on  the  same  time-scale  and  illustrates 
a  50%  duty  cycle  in  the  memory  due  to  operation  with  a  single  demultiplexer. 
Figure  17(b)  shows  an  expanded  view  of  the  CTM  output  and  illustrates  the  32X 
time  compression  of  the  test  waveform  due  to  the  corner-turning  operation.  The 
fine  structure  in  Figure  17(b)  occurs  primarily  because  of  the  feedthrough  of 
various  clock  pulses;  however,  a  significant  fixed  pattern  noise  component  was 
observed  and  is  discussed  later  in  this  report.  The  dock  rate  in  the 
multiplexer  and  demultiplexer  is  5  MHz,  and  the  signal  attenuation  observed  is 
due  partly  to  a  mismatch  in  input  and  output  node  capacitances  and  partly  to 
the  low  voltage  gain  of  the  source-follower  output  amplifier. 

The  memory  was  also  tested  in  an  SPS  mode  in  which  operation  is  the  same 
as  a  1024-stage  delay  line.  In  this  case,  however,  each  charge  packet 
experiences  only  192  transfers.  Results  of  the  SPS  mode  tests  are  shown  in 
Figure  18.  The  1024-stage  delay  (204.8  Ms  at  the  5  MHz  clock  rate)  is 
illustrated  in  Figure  18(a),  while  Figure  1 8 ( b )  shows  the  response  to  a  square 
pulse  that  is  commonly  used  to  evaluate  CTE.  In  this  case,  however,  the  CTE 
measurement  is  complicated  by  the  fixed  pattern  noise  component,  and  a 
worst-case  value  of  0.9998  is  based  on  the  noise  level  observed.  To  isolate 
the  fixed  pattern  noise  component,  it  is  first  necessary  to  eliminate 
feedthrough  of  the  low  duty  cycle  clock  pulses.  Two  techniques  were  employed 
to  accomplish  this  goal. 

The  first  technique  employed  was  an  ac-coupled/dc-restored  amplifier 
(Figure  19)  at  the  memory  output.  As  indicated  in  the  figure,  the  restore 
switch  is  closed  briefly  during  the  period  immediately  following  the 
multiplexer  output  node  precharge  operation,  and  the  sample  switch  is 
similarly  pulsed  following  transfer  of  the  signal  charge  packet.  Thus,  level 
translations  at  the  output  node  that  are  present  during  both  restore  and 
sample  operations  (such  as  those  due  to  stray  coupling  of  clock  pulses)  are 
inhibited;  however,  stray  transitions  that  occur  between  application  of  the 
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restore  and  sample  pulses  are  not  affected.  The  use  of  this  circuit  made  it 
possible  to  inhibit  the  feedthrough  of  the  low  rate,  two-dimensional  memory 
clock  pulses,  but  not  the  feedthrough  due  to  the  multiplexer  and  demultiplexer 
parallel  transfer  pulses. 

The  resulting  output  waveforms  for  both  the  CTM  and  SPS  modes  are 
illustrated  in  Figure  20.  Comparison  of  Figure  20(a)  (one  complete  CTM  cycle 
1024  samples)  and  Figure  20(b)  (one  SPS  output  burst  -  32  samples)  indicates  a 
high  degree  of  correlation  in  fixed  pattern  noise  components  due  to  the 
parallel  demultiplexer  transfer.  (The  slant  on  the  CTM  response  is  due  to 
dark  current  integration  at  the  relatively  low  clock  rate  employed.)  Figure 
20(c)  shows  the  individual  32  sample  bursts  in  the  CTM  mode  and  indicates  that 
there  is  much  less  fixed  pattern  noise  associated  with  the  parallel 
multiplexer  transfer.  The  large  spurious  samples  in  Figure  20(c)  are  due  to 
coupling  of  the  multiplexer  transfer  pulse,  as  mentioned  earlier.  The 
largest  component  of  fixed  pattern  noise  is  clearly  generated  in  the  parallel 
demultiplexer  transfer. 

To  completely  eliminate  the  possibility  of  clock  pulse  feedthrough,  a  new 
timing  circuit  was  designed  and  constructed.  Timing  diagrams  and  circuit 
schematics  are  included  in  Appendix  A.  By  lowering  the  duty  cycle  of  the 
memory  to  25%,  the  control  pulses  can  be  made  to  occur  between  32-sample  data 
strings,  rather  than  simultaneously  with  them.  Resulting  fixed  pattern  noise 
components  for  two  CTM  chips  operating  at  two  clock  rates  are  shown  in  Figures 
21  and  22.  In  this  case  post-amplification  was  employed  to  achieve  unity  gain 
through  the  memory.  The  maximum  signal  excursion  is  approximately  2  V 
peak- to-peak .  The  fixed  pattern  noise  components  observed  in  the  figures 
(neglecting  dark  current  integration)  result  in  dynamic  range  figures  of 
approximately  36  dB  on  a  frame  basis  (1024  samples)  and  better  than  40  dB  on  a 
line  basis  (32  samples)  for  chip  No.  11.  Note  that  the  fixed  pattern  noise 
distributions  are  not  correlated  between  chips  and,  except  for  dark  current, 
are  insensitive  to  clock  rate.  These  results  eliminate  dark  current  spikes  in 
the  memory  array  and  charge  redistribution  time  constants  in  the 
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(a)  CTM  Mode 
5  mV/div 
200  ps/div 


(b)  5PS  Mode 

Top;  Memory  Output 

Waveform,  2  mV/div 

Bottom:  Multiplexer  Transfer 
Pulse,  5  V/div 
Horizontal:  10  ps/div 


(c)  Detai I  of  CTM  Mode 
5  mV/div 
20  ps/div 


Both  CTM  and  SPS  Modes 


Figure  22  Fixed  Pattern  Noise  Observed  for  Chip  #9  (SI. 9).  Clock  retes 


serial-parallel  and  parallel-serial  interfaces  as  possible  sources  of  the 
fixed  pattern  component.  Both  would  result  in  a  clock  rate  dependence  of  the 
noise. 

Similar  demultiplexer-related  effects  have  been  observed  in  digital  CCD 
memories  employing  SPS  structures.  A  recent  explanation^  is  based  on  the 
reduction  of  the  channel  width  of  the  receiving  well  due  to  the  effect  of 
two-dimensional  fringing  fields.  This  is  not  a  reasonable  explanation  for  the 
effect  observed  in  the  CTN  because  both  the  serial  register  gate  and  the 
receiving  well  gate  are  independently  clocked.  No  dependence  of  fixed  pattern 
noise  on  clock  amplitude  (in  excess  of  a  minimum  value  required  to  initiate 
charge  transfer)  has  been  observed  in  the  CTM. 

A  more  reasonable  explanation  for  the  effect  is  based  on  two-dimensional 
variations  in  surface  potential  under  the  ser ia I -reg i ster  gate  that  have 
dimensions  comparable  with  the  gate  width  and  larger  than  its  length.  These 
variations  can  be  caused  by  localized  variations  in  substrate  doping  or 
surface  state  density.  Appreciable  variations  in  both  quantities  are  known  to 
occur  and  are,  in  fact,  the  cause  of  typical  MOS  transistor  threshold  voltage 
variations.  Threshold  variations  between  adjacent  overlapping  gates  (<  1  mil 
cen ter - 1 o-cen ter )  in  CCD  multiplexers  give  rise  to  fixed  pattern 
channel-to-channel  offsets  typically  from  10  to  25  mV  in  amplitude.  Figure  23 
shows  a  simplified  view  of  the  surface  potential  at  the  ser ia I -para I le I 
interface  in  the  demultiplexer.  Although  most  of  the  signal  charge  is 
transferred  from  the  demultiplexer  <b\  electrode  to  the  transfer  electrode,  a 
fraction  remains  trapped  in  a  surface  potential  minimum  under  the  <t>\  elecrode. 
When  the  demultiplexer  0 2  electrode  is  clocked  to  a  high  level,  this  trapped 
charge  (or  the  fraction  trapped  in  minima  having  dimensions  larger  than  the 
gate  length  L)  will  be  transferred  from  the  0j  electrode  to  the  02  electrode 
and  hence,  to  the  demultiplexer  output  circuit  during  the  next  serial  input 
cycle.  The  minima  are  thereby  cleared  during  each  read-in  cycle  and  are 
"refilled"  at  each  parallel  transfer. 
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Note  that  the  same  conditions  do  not  exist  at  the  memory 
array-multiplexer  interface  (see  Figure  l1*).  The  effective  channel  width  of 
the  diode  drain  is  much  narrower  (0.3  mils)  and  probably  does  not  result  in 
complete  clearing  of  the  minima  that  must  also  exist  in  the  para  I  l  e  1 -ser  i  a  I 
transfer  stage.  These  minima  remain  "filled"  at  ail  times  and  thus  do  not 
affect  the  signal  charge  packets.  The  fixed  pattern  components  introduced  in 
the  multiplexer  should,  therefore,  be  smaller  than  those  introduced  in  the 
demultiplexer,  as  is  observed  in  Figures  20,  21,  and  22. 

An  approximate  value  for  the  amount  of  charge  trapped  under  the 
demultiplexer  electrode  can  be  calculated  by  assuming  a  linear  surface 
potential  gradient  sloping  away  from  the  parallel  transfer  gate  (as  in  Figure 
23)  and  having  a  peak-to-peak  variation  of  AV  *  25  mV.  The  area  of  the  well 
region  under  the  electrode  is  about  0.6  mils2.  The  gate  oxide  capacitance 
is  approximately  0.22  pF/mils2,  and  the  trapped  charge  is  approximately  nj  * 
Cgate  AV/2q  ~  10^  electrons.  The  peak  charge  capacity  is  determined  by 
the  area  and  "depth"  of  the  surface  potential  well  in  the  charge  storage 
region  of  the  two-dimensional  transfer  cell.  The  resulting  calculation  gives 
a  peak  charge  capacity,  nmax,  of  about  10^  electrons.  The  ratio 
nT/nmax  'S  on  the  order  of  1$,  resulting  in  a  dynamic  range  (peak  signal 
to  peak  noise)  of  about  40  dB,  which  is  very  close  to  the  observed  value. 

Reduction  of  the  fixed  pattern  offsets  introduced  by  this  mechanism  will 
directly  result  from  reduction  in  the  spatial  variations  in  substrate  doping 
and  surface  state  density.  Recent  evidence9  indicates  that  substrate  doping 
variations  are  the  dominant  cause  of  MOS  threshold  variations  and  that  special 
processing  of  the  starting  materials  can  reduce  the  threshold  variations  in 
proximate  devices  to  1  to  3  mV. 

•v 

The  offsets  can  also  be  reduced  by  decreasing  the  width  of  the 
demultiplexer  channel.  This  directly  reduces  the  quantity  of  trapped  charge 
but  also  increases  the  effect  of  the  lateral  fringing  fields  due  to  the 
parallel  transfer  gate. 
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B.  Lot  602 

The  buried  channel  and  metal  etch  masks  for  the  32  x  32  CTM  were 
reprocessed  in  order  to: 

•  Correct  the  error  in  the  demultiplexer  clock  line  to  achieve  100? 
duty  cycle  operation 

•  Correct  the  buried  channel  implant  mask  in  the  multiplexer  serial 
input  circuit 

•  Add  buried  channel  implant  to  the  reset  transistor  in  the  multi¬ 
plexer  output  circuit  to  permit  higher  operating  rates. 

Figure  24  illustrates  the  input/output  relationship  of  the  CTM  operating 
with  100?  duty  cycle  at  a  200  kHz  sample  rate.  Figure  25  shows  the  CTM  output 
on  an  expanded  time  base.  In  both  photographs,  the  precharge  levels  as  well 
as  signal  are  seen  in  the  output  waveform. 

Operation  of  the  corrected  CTM  at  5  MHz  is  illustrated  in  Figure  26. 
Operation  at  this  data  rate  is  significantly  better  than  operation  of  the 
original  lot  of  CTMs.  This  is  attributed  to  the  addition  of  a  buried  channel 
implant  to  the  output  precharge  circuit.  The  upper  trace  in  Figure  26  shows 
the  5  MHz  CTM  output,  while  the  lower  trace  shows  the  output  of  subsequent 
sample-and-hol d  circuitry,  which  eliminates  the  precharge  level  from  the 
waveform.  The  observed  fixed  pattern  noise  levels  were  the  same  as  seen  for 
the  devices  from  the  previous  lot. 

C.  Lot  620 

The  third  lot  of  32  x  32  CTMs  was  fabricated  using  high  quality  3  in. 
diameter  silicon  slices  known  to  have  a  more  uniform  oxygen  distribution  than 
the  typical  2  in.  slices  used  in  the  previous  lots.  This  new  material  has 
significantly  improved  the  performance  of  CCD  imagers  by  yielding  a  more 
uniform  dark  current  distribution.  It  was  anticipated  that  the  better  oxygen 
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Figure  26  5  MHz  32  x  32  CTM  Operation.  Horizontal  =  5  ps/div 
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uniformity  would  be  accompanied  by  a  more  uniform  substrate  doping  density 
resulting  in  smaller  fixed  pattern  offsets. 


Figure  27  shows  input  and  output  waveforms  for  memory  operation  at 
1  MHz  in  the  SPS  mode,  corresponding  to  a  1024-stage  shift  register.  Charge 
transfer  efficiency  (CTE)  has  been  computed  from  the  observed  step  response  in 
this  mode  shown  in  Figure  28  (assuming  that  it  is  dominated  by  demultiplexer 
and  multiplexer  transfers)  and  is  greater  than  0.9999  at  1  MHz. 


Operation  in  the  reformatting  mode  with  100%  duty  cycle  at  a  1  MHz  rate 
is  illustrated  in  Figure  29.  Figure  29(a)  shows  the  test  waveform,  which  is 
triggered  once  per  frame  ( 1 024  cycles).  Figure  29(b)  shows  memory  output  over 
two  successive  frames  and  demonstrates  the  full  duty  cycle  capabilities. 
Figure  29(c)  and  (d)  are  expanded  views  of  the  output  during  the  first  frame 
and  confirm  the  reformatting  operation  which,  in  this  case,  corresponds  to  a 
32x  time  compression  of  the  test  waveform. 

Figure  30  shows  the  memory  output  on  a  more  sensitive  scale  with  a  dc 
input  signal  and  illustrates  that  fixed  pattern  noise  has  been  reduced  by  6 
dB,  compared  to  the  previous  lots.  Figure  30(a)  corresponds  with  Figure  29(b) 
and  shows  a  line-to-line  fixed  pattern  offset  distribution  having  an  rms 
value,  which  is  approximately  43  dB  below  the  maximum  peak-to-peak  signal 
illustrated  in  Figure  29. 

Figure  30(b)  corresponds  with  Figure  29(b)  and  indicates  excellent 
fixed  pattern  noise  performance  within  each  line  (32  samples).  A  slight 
upward  tilting  of  the  waveform  can  be  discerned,  which  Is  due  to  a  leakage 
current  component.  The  tilt  is  caused  by  the  fact  that  each  consecutive 
sample  in  a  line  was  stored  In  the  two-dlmins  Iona  I  array  for  one  additional 
eye  1 e . 

An  investigation  of  spectral  distortion  due  to  nonlinearity  of  the 
input  sampling  scheme,  and  the  addition  of  fixed  pattern  noise  to  the  signal 
was  conducted  using  these  devices.  A  microprocessor-generated  32-byte  sine 
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Figure  28  Step  Response  Used  to  Determine 

CTE.  Vertical  20  mV/div,  Horizontal 
I  ps/div 
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(a)  INPUT 


Figure  29  Reformatting  Operation  of  the  Memory  at  1  MHz.  (a)  Input  waveform 
0.5  V/div.,  (b)  Memory  output  0.05  V/div.  Horizontal  Scale  200 
ns/div.  (c)  Memory  Output,  50  ps/div.  (d)  5  ps/div. 


(a)  SmV/DIV 


Figure  30  DC  Response  in  Reformatting  Mode  Showing  Fixed  Pattern  Noise 
Components.  (a)  Frame  to  Frame,  5  mV/div. ,  500  (is/div., 

(b)  Line  to  Line,  5  mV/div.,  5  us/div. 


wave  with  an  accuracy  of  eight  bits  was  phase-locked  to  the  CTM  timing  to 
produce  a  signal  source  with  low  harmonic  content,  as  shown  in  Figure  31. 
Ideally,  the  CTM  will  reformat  the  phase-locked  signal  to  reproduce  the 
32-byte  sine  wave  shifted  in  frequency  by  five  octaves;  i.e.,  the  output 
signal  fundamental  occurs  at  the  line  rate  of  the  device.  Harmonic  distortion 
and  fixed  pattern  contributions  at  the  output  can  then  be  evaluated  by 
comparing  the  resultant  spectral  components  with  the  signal  fundamental. 

Figures  32,  33,  and  show  the  results  obtained  with  an  output  signal 
magnitude  of  100  mV  peak-to-peak  at  a  250  kHz  sample  rate.  The  output 
fundamental  is  at  a  frequency  of  250  kHz  +  32  *  7 - 8 1  kHz.  Figure  32  shows 
that  the  second  harmonic  distortion  component  has  a  magnitude  less  than  -30  dB 
relative  to  the  fundamental,  with  the  magnitude  of  all  other  harmonics  less 
than  -40  dB.  The  large  second  harmonic  contribution  to  the  total  distortion 
is  characteristic  of  the  "fill  and  spill"  input  scheme  used  in  this  test.  The 
linearity  of  the  f i  1  I -and-sp i I  I  input  technique  depends  on  maintaining  a 
minimum  interval  between  "fill"  and  transfer  into  the  CCD.  This  interval  is 
fixed  at  25$  of  one  clock  period  by  the  exerciser  circuitry  and  leads  to 
higher  second  harmonic  distortion  than  is  ultimately  achievable  using  this 
input  technique.  Figure  33  shows  the  spectral  distribution  of  the  intraline 
fixed  pattern  noise,  which  contains  components  at  multiples  of  the  line  rate 
( 7 - 8 1  kHz).  By  removing  the  fat  zero  charge  in  the  device,  the  contribution 
of  clock  feedthrough  to  the  output  noise  can  be  measured  as  shown  in  Figure 
34.  If  the  clock  feedthrough  contributions  of  Figure  34  are  "subtracted" 
from  the  total  fixed  pattern  spectrum  shown  in  Figure  33,  the  remaining  noise 
components  represent  the  contributions  due  to  threshold  variations  that  would 
not  be  removed  from  the  output  signal  by  subsequent  filtering.  Care  must  be 
taken  in  the  evaluation  of  the  separate  contributions  that  add  to  the  total 
noise  spectrum  because  the  phase  relationship  between  these  components  is 
unknown.  However,  reasonable  estimates  of  the  individual  contributions  can  be 
made.  From  Figures  33  and  3** 
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Figure  31  Input  Signal  Spectrum 
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Figure  32  Spectrum  of  Reformatted  Data 
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Figure  33  Spectral  Components  of  Fixed  Pattern 
Noise  for  a  32  x  32  CTM  (Leftmost 
Spike  is  Zero  Hz  Marker) 


Figure  Ik  Spectral  Components  of  Clock  Feedthrough 
at  Output  of  32  x  32  CTM  (Leftmost  Spike 
is  Zero  Hz  Marker) 
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it  appears  that  the  worst  spectral  components  of  fixed  pattern  noise  that  can 
be  attributed  to  threshold  variations  are  at  least  -48  dB  relative  to  the 
signal  fundamental  shown  in  Figure  32.  For  comparison,  the  rms  thermal  noise 
level  at  the  CTM  output  (dominated  by  kTC  noise  generated  at  the  output  node) 
is  calculated  to  be  approximately  60  dB  below  the  peak  output  signal  level. 
The  need  for  further  reduction  in  the  fixed  pattern  noise  introduced  in  the 
input  demultiplexer  clearly  requires  further  effort  in  both  materials 
preparation  and  device  design. 
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SECTION  IV 
64  x  64  CTM  OESIGN 


Having  established  the  viability  of  the  two-dimensional  charge  transfer 
concept  for  the  CTM  device,  a  redesign  was  accomplished  to  achieve  a 
configuration  more  closely  aligned  with  the  original  program  objectives  (100  x 
100  array,  50  MHz  operation).  Consideration  of  potential  performance  versus 
technical  risk  led  to  the  decision  to  expand  the  CTM  structure  to  a  64  x  64 
element  array.  Although  the  same  basic  demultiplexer,  multiplexer,  and 
two-dimensional  transfer  cell  designs  were  employed,  a  few  minor  modifications 
were  incorporated  for  improved  performance. 


The  demultiplexer  channel  width  was  reduced  from  1.6  mils  (40.6  urn)  to 
0.8  mils  (20.3  urn)  to  improve  fixed  pattern  noise  performance.  A  new  output 
amplifier  circuit  was  included  to  enhance  high  frequency  performance.  A 
schematic  diagram  is  shown  in  Figure  35.  Specific  modifications  included: 


•  increasing  the  aspect  ratio  of  the  buried  channel  reset  transistor 
from  3/1  in  the  32  x  32  design  to  6.7/1 

•  incorporation  of  two-stage  source-follower  output  buffer  amplifier 
c i rcui t . 

With  a  500  £2  external  load  resistor  and  5  pF  load  capacitance,  the  -3  dB 
bandwidth  of  the  output  stage  is  predicted  to  be  greater  than  100  MHz  and  the 
on-chip  power  dissipation  is  less  than  100  mW. 

Figure  36  shows  a  photomicrograph  of  the  completed  chip.  The  die 
dimensions  are  180  mils  by  220  mils  (4.57  mm  x  5.59  mm).  Experimental  results 
are  discussed  in  the  following  section. 
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gure  35  Schematic  Diagram  of  the  High-Speed  Output  Circuit 
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SECTION  V 


64  x  64  CTM  EVALUATION  ' 

(' 

The  lot  of  64  x  64  CTMs  was  completed  in  April  1979.  Initial  | 

evaluation  indicated  a  very  low  yield  due  to  a  high  incidence  of  interlevel  i 

shorts  between  the  polysilicon  gate  levels.  The  remaining  devices  were  j 

manually  probed  to  eliminate  those  exhibiting  interlevel  shorts  among  the  | 

memory  array  clock  phases.  A  number  of  operational  units  were  obtained  from 
the  presorted  chips.  I 

P 

Figure  37  shows  the  response  of  a  64  x  64  CTM  operated  in  the  SPS  mode  at  r 

a  1  MHz  clock  rate  and  shows  the  resulting  4.096  ms  delay.  Figure  38  shows  f 

100%  duty  cycle  reformatting  operation  at  a  1  MHz  clock  rate.  The  dynamic  j 

i 

range  of  the  device  is  comparable  with  that  observed  for  the  32  x  32  element  l 

CTM  on  high  quality  starting  material.  Figure  39  shows  the  observed  fixed  f 

pattern  noise  in  the  reformatting  mode.  Figure  39(a)  shows  several  output 
fields  and  indicates  that  the  line-to-line  variations  are  less  than  1  mV  | 

peak-to-peak .  Figure  39(b)  shows  several  lines  of  the  output  waveform  and 
indicates  that  the  primary  fixed  pattern  component  is  due  to  dark  current  j 

integration.  The  amplitudes  of  the  higher  frequency  components  are  I 

substantially  smaller  than  1  mV.  j 

Figure  40(a)  shows  the  step  response  of  the  64  x  64  CTM  in  the  SPS  mode 
at  a  5  MHz  clock  rate  and  is  indicative  of  excellent  CTE.  Figure  4 0 ( b )  shows 
the  swept  spectrum  of  the  transfer  function  of  the  SPS  delay  line.  An 
estimate  of  the  CTE  can  be  obtained  from  the  departure  of  the  spectrum  from 
the  ideal  sin  ( rrf /f c ) /  (rtf/fc)  spectrum.  The  maximum  error  occurs  at  the 
Nyquist  rate  (fc/2)  and  a  conservative  estimate  of  CTE  ■  0.9999 
results.  ^ 

! 

These  devices  were  evaluated  using  a  timing  circuit  that  was  designed  for 
high  speed  operation  using  ECL  components.  Discrete  clock  driver  circuits  were 
designed  using  Schottky  diode  clamped  VHF  power  transistors.  The  timing  and 
driver  circuits  are  included  in  Appendix  A.  The  ECL  circuitry  was  capable  of 

58 


Input 

500  mV/di v 


B|  Output 

I  50  mV/div 

„  Horizontal 


37  SPS  Operation  of  the  64  x  64  CTM  at  1  MHz 
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Figure  38 


100%  Duty  Cycle  Reformatting  Operation  of  the  64  x  64  CTM 
at  1  MHz 
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Figure  39  Observed  Fixed  Pntt<rn  Noise  li  ve!',  in  the  64  x  64  CTM. 

(a)  Line-to-line  and  (b)  Sample-i  .-sample  in  one  line. 
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Figure  40  (a)  Step  Response  in  SPS  Mode  at  a  5  MHz  Clock  Rate.  (b)  Swept 

Spectrum  of  SPS  Delay  Line  Transfer  Function  from  Which  CTE  = 
0.9999  is  Estimated. 
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generating  the  necessary  waveforms  for  operation  in  excess  of  50  MHz  (200  MHz 
master  clock  rate).  However,  variations  in  propagation  delay  in  the  ECL-TTL 
converters  used  in  the  driver  circuits  limited  high  frequency  operation  to  a 
maximum  clock  rate  of  40  MHz. 

Figure  4l  shows  operation  in  the  reformatting  mode  at  a  25  MHz  clock 
rate.  Note  that  a  number  of  clock  "glitches"  are  discernible  on  the  output 
waveform. 

Fixed  pattern  noise  performance  at  25  MHz  is  shown  in  Figure  42.  Figure 
42(a)  shows  one  line  of  the  output  waveform  and  indicates  that  the  dominant 
noise  component  is  due  to  capacitive  coupling  of  the  memory  and  transfer  clock 
waveforms  to  the  output  node.  Figure  42(b)  shows  one  frame  of  the  output 
waveform  and  indicates  a  spurious  response  during  the  first  line  of  each 
field.  This  response  has  been  traced  to  coupling  of  the  demultiplexer 
transfer  pulse  to  the  demultiplexer  input  node.  The  remaining  fixed  pattern 
noise  appears  to  be  2  to  3  mV  peak-to-peak ,  which  is  36  to  40  dB  below  the 
peak  signal  level.  As  anticipated,  the  dynamic  range  is  slightly  reduced  at 
this  higher  clock  rate. 

Figure  43(a)  shows  the  response  of  the  64  x  64  CTM  to  a  triangular  test 
waveform  in  the  SPS  mode  at  a  40  MHz  clock  rate.  The  test  waveform  was 
triggered  at  the  frame  rate  so  that  the  102.4  us  delay  is  indicated  by  the 
interval  between  the  output  waveforms.  Figure  43(b)  shows  more  detail  of  the 
output  waveform  during  the  flat  portion  of  the  output  in  Figure  43(a)  and 
illustrates  the  complexity  of  the  waveform  due  to  coupling  of  the  multiplexer 
clock  pulses  to  the  output  node.  Note  that  further  sampling  of  this  waveform 
will  require  an  extremely  narrow  sampling  aperture  to  avoid  errors  caused  by 
timing  circuit  phase  noise  and  the  steep  slope  of  the  output  waveform  during 
the  signal  interval. 

Figure  44  shows  the  results  of  an  independent  measurement  of  the 
bandwidth  of  the  two-stage  source  follower  output  amplifier.  The  top  trace  is 
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Figure  43  Output  Waveforms  for  the  64  x  64  C.TM  in  the  SPS 
Mode  at  a  40  MH/  Clock  Rate 
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Figure  A4  Output  Amplifier  Frequency  Response.  Top 
Trace:  tracking  generator  output  level. 
Bottom  Trace:  amplifier  output. 
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the  output  of  a  tracking  generator,  which  is  ac-coupled  to  the  Vrest  node 
of  the  reset  switch  transistor.  The  dc  operating  point  was  established 
through  a  50  fl  resistor  and  the  gate  of  the  switch  transistor  was  maintained 
at  +15  V  to  ensure  operation  in  the  linear  region.  The  external  load  attached 
to  the  output  node  was  a  1 000 0  resistor  in  parallel  with  a  load  capacitance 
of  approximately  6  pF.  The  lower  trace  in  Figure  44  shows  the  response 
measured  with  a  FET  probe  at  the  output  pin.  The  voltage  gain  of  the 
amplifier  is  0.45  (-7  dB)  and  the  -3  dB  bandwidth  is  approximately  50  MHz. 
The  bandwidth  can  be  increased  by  reducing  the  value  of  the  load  resistor  and 
increasing  the  Vreset  level  to  increase  the  drain  current  in  the  output 
stage. 

All  of  the  operational  64  x  64  CTM  devices  eventually  failed  during  the 
high  frequency  tests.  The  last  operational  device  was  used  to  obtain  the 
photographs  shown  in  Figure  43  and  failed  before  the  reformatting  mode  could 
be  tested.  The  problem  appears  to  be  related  to  the  higher  peak-to-peak 
levels  of  the  clock  waveforms  that  were  required  for  successful  high-speed 
operation  and  to  the  questionable  integrity  of  the  interlevel  oxide  as 
suggested  by  the  low  initial  yield.  It  is  suspected  that  the  higher  clock 
levels  are  required  to  offset  voltage  drops  across  the  substrate,  which  can 
become  appreciable  at  high  clock  rates. 

Another  set  of  devices  (30)  was  bonded  during  the  preparation  of  this 
report.  One  "semioperat  iona  I"  device  was  identified,  and  it  failed  during 
initial  evaluation. 

Because  of  the  excellent  yield  obtained  for  the  32  x  32  CTM  lots,  it  is 
apparent  that  the  low  yield  observed  for  the  64  x  64  CTM  is  due  to  a 
processing  error.  Inspection  of  the  processing  history  of  this  lot  failed  to 
ncover  an  obvious  error.  It  is  anticipated  that  fabrication  of  a  second  lot 
(with  particular  attention  paid  to  the  deposition  of  the  interlevel  oxide) 
will  be  required  to  fully  evaluate  the  64  x  64  CTM  device. 
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SECTION  VI 


BIPOLAR  TEST  BAR  DESIGN 

At  the  design  review  held  in  April  1978  it  was  decided  that  the  MESFET 
amplifier  development  discussed  in  the  proposal  would  be  replaced  with  the 
development  of  a  process-compatible  bipolar  device.  The  results  of  an 
exploratory  development  program  indicated  the  following  potential  advantages 
of  a  bipolar  emitter-follower  CCD  output  stage: 

•  Lower  CCD  output  mode  capacitance,  resulting  in  significant 
voltage  gain  as  compared  to  present  output  techniques. 

•  Lower  amplifier  output  impedances,  resulting  in  potentially 
higher  output  circuit  bandwidths. 

•  Elimination  of  the  precharge  operation  (in  some  applications), 
a  considerable  simplification  for  high  speed  operation. 

These  advantages  are  discussed  in  more  detail  in  the  remainder  of  this 
section,  following  a  review  of  the  high-speed  capabilities  of  conventional  CCD 
output  circuits. 

A.  High-Speed  CCD  Output  Circuits 

Figure  A 5  illustrates  two  typical  circuits  used  to  accomplish  the 
charge/voltage  conversion  and  buffering  in  CCD  output  circuits  at  low  data 
rates.  Figure  A5(a)  shows  a  reset  amplifier  in  which  signal  charge  is 
transferred  to  an  output  diode  that  is  reset  once  per  clock  cycle  to  erase  the 
collected  charge.  Figure  A5(b)  shows  the  so-called  "floating  gate"  output 
circuit  in  which  charge  is  sensed  via  capacitive  coupling  to  the  surface 
potential  under  the  sense  electrode.  The  signal  charge  is  effectively 
erased  when  it  is  transferred  under  the  ?2  electrode  and  subsequently  to  a 
reverse-biased  diode  drain.  Since  there  is  no  dc  term  associated  with  the 
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charge  sensing  operation,  the  reset  transistor  need  only  be  operated  at  low 
duty  cycles  to  obviate  the  effects  of  leakage  current,  which  can  result  in  a 
slight  time  dependence  of  the  Vref  level  (shown  exaggerated  in  the  figure). 


Both  circuits  result  in  a  samp  1 e-and-ho I d  waveform  that  can  be 
approximated  by  the  rectangular  waveform  shown  in  Figure  46(a)  for  the  case  of 
a  dc  ("fat  zero")  input  signal.  The  period  of  the  waveform  is  T  =  l/fc  where 
fc  is  the  clock  frequency  and  the  duration  of  the  signal  interval  is  t,  which 
is  taken  to  be  3T/4  in  the  example.  Neglecting  the  Vref  offset,  the 
resulting  output  spectrum  is  shown  in  Figure  46(b)  and  consists  of  a 
distribution  of  6-functions  occurring  at  integer  multiples  of  the  clock 
frequency  fc  and  weighted  by  the  aperture  function  sin  (irfr  )/(nfr  ) . 


Figure  46(c)  shows  the  situation  when  signal  samples  g(nT)  are  added  in 
the  output.  The  continuous  function  g(t)  has  a  band-limited  spectrum  G(f) 
shown  in  Figure  46(d).  Neglecting  the  flat  zero  component  and  assuming  that 
the  output  charge  transfer  can  be  modelled  by  a  6 -function  current  source 
l0(t)  having  a  constant  Fourier  transform  1^,  the  analysis  in  Appendix  B 
shows  that  the  spectrum  of  the  output  voltage  Vout(f)  for  the  reset 
ampl  if ier  is  given  by 
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where  Cout  is  the  total  capacitance  of  the  output  node  and  the  infinite  sum 
is  indicative  of  aliasing  of  the  spectrum  G  ( f )  about  multiples  of  fc.  The 
resulting  output  spectrum  appears  in  Figure  46(e).  Note  that  in  the  ideal 
case  t  *T,and  the  sin  x/x  function  optimally  filters  the  clock  "noise"  due  to 
the  fat  zero  term. 


Potential  device  related  clock  rate  limitations  include  operation  of  the 
rest  switch  in  the  reset  amplifier  and  the  bandwidth  of  the  output  buffer  for 
both  configurations.  A  practical  limitation  of  the  floating  gate  approach  is 
due  to  capacitive  coupling  of  the  clock  waveforms  applied  to  the  electrodes 
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gure  46  Frequency  Response  of  the  Reset  Amplifier.  (a )  Output  waveform  and  (b ) 
spectrum  for  "fat-zero"  component.  (c)  Output  waveform  for  input  signal 
having  spectrum  shown  in  (d)  Resulting  output  spectrum  is  shown  in  (e) 
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adjacent  to  the  sense  electrode.  This  effect  can  be  reduced  by  maintaining 
the  adjacent  electrodes  at  appropriate  dc  levels  to  act  as  electrostatic 
shields.  This  results  in  an  unavoidable  increase  in  the  effective  gate  length 
of  the  sense  electrode,  causing  an  increase  in  the  transfer  time  constant, 
which  itself  limits  the  maximum  operating  rate.  For  these  reasons  the 
floating  gate  amplifier  is  seldom  used  where  high  clock  rates  are  desired,  and 
further  discussion  is  limited  to  the  reset  amplifier  configuration. 

Figure  47  shows  an  equivalent  circuit  of  the  reset  amplifier.  Charge 
transfer  is  modelled  by  the  current  source  ls  which  provides  current  pulses 
having  amplitude  -ls  and  duration  so  that  the  total  charge  transferred  is 
Qs  =  -Ists.  The  total  output  node  capacitance  is 

Cout  ■  cgsl  +  CD  +  Cgd2  +  cgs2  (|_A) 

where  Cgsi  =  gate-source  capacitance  of  Ml 

Cg  =  reverse  bias  capacitance  of  outut  diode 
Cgd2  =  gate-drain  capacitance  of  M2 
CgS2  =  gate-source  capacitance  of  M2 

A  =  voltage  gain^of  the  output  stage  (M2,  G^) 

The  transient  associated  with  the  reset  operation  is  determined  by  the 
drain-source  conductance  of  Ml  and  capacitance  Cout.  Assuming  that  the 
amplitude  of  the  reset  pulse  Vreset  is  larger  than  Vref,  then  Ml 
operates  in  the  linear  region,  since  the  peak  -train-source  voltage  corresponds 
to  the  CCD  output  signal,  which  is  typically  less  than  one  volt. 

The  drain-source  conductance  of  Ml  is 


Gq$  *  2  K  Vgso  W/L 


+  r  *  K  m 
*  , 


u*  /t  ~  lo"5  for  typical  devices  having  800  to  1000  ^  oxide 
ox  ox 

Vq so  '5  the  gate-source  voltage  during  the  reset,  and  W  and  L 


/3 


are  the  width  and  length  of  the  transistor  gate  region.  The  reset-time 
constant  is 


Preset  =  Cout/(2K  Vg$0  W/L) 

Typical  values  of  the  parameters  are  Cout  ~  0.25  pF,  Vqso  ~  2  V,  and  W/L 
~  3  giving  Treset  ~  2  ns  and  resulting  in  a  -3  dB  response  frequency  of 
about  80  MHz.  Since  a  low  duty  cycle  is  desirable  for  the  reset  operation, 
this  response  time  is  marginal  for  device  operation  in  excess  of  AO  MHz. 

The  value  of  -rreset  can  be  substantially  reduced  by  employing  a 
depletion  transistor  for  Ml,  thereby  increasing  Vqsq.  A  buried  channel 
device  exhibits  a  threshold  voltage  of  about  -6  V  at  the  bias  levels  employed 
in  the  output  circuit,  and  results  in  a  typical  Vgso  °F  ®  V.  Thus 
Treset  can  he  reduced  to  approximately  0.5  ns  which  is  sufficient  for 
operation  at  a  50  MHz  clock  rate  with  a  25%  reset  pulse  duty  cycle. 

The  function  of  the  source-follower  output  amplifier  is  to  provide  a 
buffer  between  the  low  capacitance  CCD  output  node  Cout,  and  the  output 
load  capacitance  Cl  due  to  package  parasitics  and  exernal  circuitry.  Typical 
values  of  Cl  are  5  to  10  pF,'and,  while  it  is  desirable  to  keep  Cout  as 
small  as  possible,  typical  values  range  from  0.1  to  0.25  pF.  Thus,  a 
worst-case  capacitance  ratio  of  100  must  be  taken  into  account  in  the  amplifier 
design,  while  maintaining  the  desired  bandwidth  and  power  dissipation 
parameters. 

The  capacitance  buffering  capabilities  of  a  single  M0S  source-follower 
stage  can  be  calculated  from  small  signal  device  parameters  at  the  established 
bias  levels.  The  following  analysis  is  based  on  typical  device  parameters  for 
0.3  (im  gate  lengths  fabricated  using  the  self-aligned  gate  NM0S  process. 
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The  input  capacitance  of  a  source-follower  stage  is 


Cjn  “  Cgd  +  CgS  (l-A) 

Cgd  is  the  gate-drain  capacitance  in  saturation 
Cgd  “  WCedge  +  WL  Cgate/2 

where  Cecjge  is  the  fixed  capacitance  due  to  the  small  gate-drain  overlap, 
Cgate  IS  taken  as  the  gate  oxide  capacitance,  and  L*  is  the  effective  gate 
length  that  takes  into  account  lateral  diffusion  of  the  drain  and  source  regions 
(l/  =  0.22  mils).  CgS  is  the  gate-source  capacitance  and  is  equal  to  Cg(j 
in  saturation  and  A  is  the  voltage  gain  of  the  source-follower  stage  given  by 


+  ju>C 


A  = 


m 


+  GL  + 


as. 
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Asuming  that  C9S  «  Cl,  the  -3  dB  cutoff  frequency  is  given  by 

“C  -  (gm  +  gl)/cl  . 

The  value  of  the  Gl  is  based  on  power  dissipation  requirements 

Gl  =  2  Pdmax/voo2  > 

where  PqhAX  is  the  maximum  desirable  power  dissipation  and  Vpp  is  the 
stage  power  supply  voltage.  It  is  assumed  that  Gl  will  be  furnished  off-chip 
so  that  PpHAX  's  taken  to  be  200  mW  with  Vpp  ■  15  V.  The  resulting 
value  of  Gl  is  1.78  mmhos. 


■  • 
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The  required  gm  is  then  dictated  by 


9m  “  '“cCL  “  GL 

and  is  related  to  device  parameters  by 

gm2  -  2KID  W/L 

where  K  *  ~  10“5.  The  gate  width  is  then 
W  -  gm2L/2KID 

*  gm2L/KVDDGL  , 

where  it  has  been  assumed  that  the  voltage  drops  across  Gl  and  the  transistor 
are  equal.  The  input  capacitance  is  then  calculable. 

Taking  C|_  *  10  pF  and  m  *  2nx  100  MHz,  the  required  gm  is  it. 5  mmhos 
and  A  ■  0.72.  The  resulting  gate  width  is  W  «  23*4  mils.  With  Ce(jge  * 
0.009  pF/mil  and  Cgate  »  0.22  pF/mii2  the  input  capacitance  is  Cjn  = 
1.15  pF,  which  is  too  large  to  be  applied  to  the  CCD  output  node.  Further 
decrease  in  the  input  capacitance  can  only  be  achieved  at  the  expense  of 
higher  power  dissipation  or  inclusion  of  a  second  source-follower  stage  to 
provide  additional  buffering. 

Repeating  the  design  process,  the  second  stage  power  dissipation  is 
ratioed  by  the  capacitance  reduction  of  the  output  stage: 

PD  “  pDMAX  Cin/CL  -  23  mW. 

It  is  assumed  that  the  same  Vgo  supply  Is  used,  thus  G|_2  “  204  mhos,  the 
required  <|n  1*  518  pmhos  and  A  ■  0.72.  The  gate  width  is  W2  ■  2.6  mils  and 
the  resulting  input  capacitance  is  C(n  -  0.16  pF,  which  is  a  reasonable 
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value  for  application  to  the  CCO  output  circuit.  The  overall  voltage  gain 
of  the  two-stage  amplifier  is  0.52  and  the  -3  dB  bandwidth  is  64  MHz. 

Although  the  paper  design  presented  above  is  not  optimized,  it  is 
indicative  of  the  performance  available  from  two-stage  NMOS  source-follower 
amplifiers.  The  primary  tradeoff  is  (as  usual  in  MOS  designs)  bandwidth 
versus  power  dissipation  for  a  fixed  load  capacitance.  Note  that  the  results 
are  independent  of  oxide  thickness,  since  that  parameter  affects  W  and 
Cgate  'n  a  complementary  manner.  However,  the  results  are  approximately 
proportional  to  the  square  of  the  minimum  attainable  gate  length  (L),  since 
that  parameter  affects  W  and  the  expressions  for  Cg5  and  Cgg  in  a  linear 
manner . 

It  is  concluded  that  conventional  CCD  output  circuits  can  be  employed 
at  clock  rates  near  50  MHz.  This  may  be  expanded  to  the  80  to  90  MHz  region 
with  shorter  gate  lengths.  The  on-chip  power  dissipation  is  high  and  may  be 
prohibitive  in  mult ichannel  circuits  requiring  several  independent  output 
amplifiers.  In  addition,  generation  of  the  short  duty  cycle  reset  pulse 
waveform  at  MOS  drive  levels  is  a  significant  problem  area. 

Figure  48  shows  a  schematic  diagram  of  a  high-speed  MOS  buffer 
amplifier  that  was  included  on  the  32  x  32  CTM  test  bar.  This  circuit  employs 
three  source-follower  stages  and  a  source-coupled  voltage  gain  stage  to 
compensate  for  the  low  gain  of  the  source-followers.  SPICE  circuit  simulation 
predicted  a  voltage  gain  of  1.3  (2.1  dB);  -3  dB  bandwidth  of  80  MHz  and  power 
dissipation  of  125  mW  when  driving  an  external  load  of  1  kfj  in  parallel  with 
10  pF. 

Figure  49  shows  the  predicted  and  measured  frequency  response  for  the 
high-speed  amplifier.  Although  the  measured  bandwidth  (50  MHz)  is  lower 
than  predicted,  the  circuit  exhibits  slightly  more  gain  (6  dB)  than  expected. 
Thus,  the  measured  gain-bandwidth  product  is  In  good  agreement  with  the 
predicted  value.  Further  analysis  Indicated  that  the  primary  sources  of  the 
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rcu i t  Schematic  of  the  High-Speed  Buffer  Amplifier 
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Figure  49  Measured  and  Predicted  Frequency  Response  of 
the  High  Speed  Output  Buffer 


unexpected  band! imiting  were  an  underestimation  of  the  device  capacitances  and 
excessive  load  impedance  in  the  gain  stage  due  to  unexplained  shifts  in  the 
threshold  voltage  of  the  buried  channel  load  transistors. 


B.  Bipolar  CCD  Output  Circuit 

Based  on  cons  i  derat  i  on  of  the  difficulties  associated  with  operating  a 
conventional  CCD  output  circuit  at  high  clock  rates,  an  earlier  output  scheme 
was  reconsidered.  Figure  50  shows  a  schematic  diagram  that  illustrates  the 
approach.  The  reset  switch  is  replaced  with  a  resistor  connected  to  the 
Vref  supply.  The  reset  operation  can  be  considered  to  be  continuous, 
rather  than  occur ing  at  a  specified  time  during  the  clock  cycle.  The  RCout 
time  constant  of  the  output  node  results  in  zeroth  order  integration. 

An  analysis  of  the  response  of  this  circuit  using  the  same  asumptions 
applied  previously  to  the  reset  amplifier  is  included  in  Appendix  B  and  shows 
that  the  spectrum  of  the  output  waveform  is  given  by 

RC  r-  "il 

Vout(f)*~f^  <!“  (j  +  <2  "fRCout)2J  nL  G(f  +  nfc) 


The  output  circuit  transfer  function  Hout(f)  •  Vout (f ) /G( f )  is  shown  in 
Figure  51  for  the  two  circuit  configurations  with  0  <  f  <-  fc/2.  t/T  is  used 
as  a  parameter  for  the  reset  amplifier,  while  RCout/T  i  j  the  parameter  for 
the  present  case.  Cout  is  assumed  to  be  the  same  for  both  cases. 

It  is  seen  that  the  response  for  RC/T  ■  0.25  Is  approximately  the  same  as 
for  t/T  ■  0.5  except  for  an  additional  8.5  dB  attenuation.  Similarly,  the 
response  for  RC/T  •  0.125  is  nearly  the  same  as  that  for  t/T  ■  0.25,  except 
for  an  additional  12  dB  attenuation.  Note,  however,  that  whereas  the  noise 
introduced  in  the  reset  amplifier  Is  effectively  sampled  end  thereby 
completely  aliased  into  the  Nyquist  interval  (0  <  f  <-  fc/J)  ,  the  noise 
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generated  in  the  RC  output  circuit  can  be  further  bandlimited  in  external 
circuitry,  leading  to  substantially  lower  noise. 

Thus,  if  Cout  is  reduced  by  a  factor  of  3  to  ,  output  circuit 
performance  of  the  circuit  very  closely  approaches  that  of  the  reset 
amplifier,  and  it  is  not  necessary  to  generate  the  high  speed  reset  pulse. 
This  is  the  rationale  that  led  to  the  application  of  the  bipolar  output  stage 
to  high  speed  devices,  since  the  effective  output  capacitance  can  be 
dramatically  reduced.  The  bipolar  device  is  integrated  in  the  CCD  output 
circuit.  The  structure  is  realized  by  adding  one  mask  and  two  extra  ion 
implantation  steps  near  the  end  of  the  CCD  process  to  fabricate  a  nondepleted 
n-base  region  and  a  self-aligned  p+  emitter.  This  combination  forms  a 
vertical  PNP  transistor  structure  with  substrate  serving  as  the  collector  and 
operates  as  an  emitter  follower.  The  emitter  can  be  biased  with  an  external 
resistor  to  provide  a  low  output  impedance  for  direct  connection  to  external 
circuitry.  The  dominant  output  node  capacitance  is  the  base-collector 
capacitance  of  the  bipolar  device,  since  the  load  capacitance  is  buffered  by 
the  3  of  the  device.  Figure  52  shows  an  equivalent  circuit  of  the  bipolar 
output  ampl if ier. 

When  a  charge  packet  flows  into  the  base  region  of  the  PNP  transistor,  it 
turns  on  and  draws  emitter  current  that  isgtimes  the  base  current  equivalent 
of  the  signal  charge.  This  current  gain  allows  a  much  larger  drive 
capability.  When  the  charge  packet  is  annihilated  by  recombination  in  the 
base,  the  PNP  transistor  switches  off  and  is  automatically  reset  for  the  next 
charge  packet.  Contrary  to  floating  node  schemes,  there  is  no  precharging 
necessary  and  no  clock  feedthrough  at  the  output  when  no  charge  packets  are 
sensed  in  this  mode  of  operation  of  the  bipolar  output  detector.  This  feature 
is  also  expected  to  result  in  reduced  fixed  pattern  noise  in  multichannel  CCD 
ci  rcuits. 
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C.  Bipolar  Output  Concept  And  Design  Consideration 

Figure  53  shows  the  cross  section  of  a  CCD  channel  that  incorporates  a 
bipolar  charge  detection  element  at  the  output.  This  detection  element  is 
shown  for  a  buried  channel  CCD  structure.  It  can,  in  principle,  be  used  with 
both  surface  and  buried  channel  CCDs.  The  two-phase  coplanar  electrode1®, 1 1 
CCD  shown  in  Figure  53  is  representative  of  the  CCD  structure  that  was  used  to 
demonstrate  the  concept.  The  concept  is  by  no  means  limited  to  this  CCD 
structure  and  may  be  used  with  other  two-,  three-,  or  four-phase  versions  as 
well. 

Figure  53  also  shows  an  integrated  load  device  and  source-follower  buffer 
stage,  which  allow  operation  of  the  circuit  in  one  of  several  modes  to  be 
discussed  later. 

The  bipolar  element  is  fabricated  at  the  end  of  the  CCD  process  by 
opening  a  window  in  the  polysilicon  output  gate  electrode.  The  n-type  base 
region  implant  is  performed  through  the  same  opening.  In  principle,  the 
lateral  straggle  of  the  implants ^  should  be  adequate  to  ensure  that  no 
emitter-collector  shorts  occur  at  the  edge  of  the  window.  To  guarantee  this, 
however,  a  low  temperature  heat  step  200oC)  may  be  used  to  flow  the  resist 
into  the  window  to  provide  additional  margin. 

The  detection  scheme  presented  here  is  designed  to  operate  at  very 
low  current  levels.  The  effective  base  current  into  this  device  may  be 
computed  from  the  dynamics  of  a  charge  packet  being  transferred  into  the  base 
region  through  the  output  gate.  It  has  been  shown, !3  that  the  charge  transfer 
in  a  CCD  is  approximately  exponential  and  the  electrode  potential  has  a  time 
constant  Tf,  which  may  be  described  by 
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Figure  53  The  Structure  and  Concept  of  the  Bipolar  Output  for  CCDs. 

The  top  figure  shows  the  cross  section  of  the  output  and  the 
bottom  shows  the  operation  of  this  output  scheme. 


where 


D  =  electron  dlffusivity 
n  =  electron  mobility 
L  =  length  of  the  gate  electrode 
Ev  =  lateral  fringing  field 

C)  =  a  paramef^ii/that  varies  from  l  to  2  as  the  normalized  fringing 
field  varies  from  0  to  °°. 

The  charge  Qg C t )  flowing  into  the  base  is  represented  as 

V‘>  ’  Vwn<'-'tAf>  •  (2) 

In  this  expression  Q0  is  the  charge  capacity  per  unit  area  and  Awe | |  is 
the  area  of  the  CCD  storage  well. 


The  base  current  in  the  bipolar  device  will  then  by  given  by 


&  /t/Tf 

B  dt  '  well  Tf 

This  exponentially  decaying  pulse  of  is  is  fed  Into  the  bipolar 
i s  normal ly  off. 


(3) 

device  which 


The  transient  response  due  to  this  current  may  be  estimated  from  the 
charge  control  analysis  of  the  bipolar  transistor.  The  charge  packet  will 
supply  the  base  recombination  and  charge  the  emitter  and  collector  junction 
capacitance  as  well  as  the  base.  The  differential  equation  describing  this 
process  may  be  written  as 


dt 


C  v. 
g  be 


-i. 


(4) 
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where 


08  base  minority  charge 

Q^e,  Q^c  =  junction  transition  charge 
Cg  =  output  gate  capacitance 

vke  =  base  emitter  vol.tage. 


Following  the  standard  charge  control  treatment^  Equation  (A)  may  be 
written  for  our  configuration  as 


(5) 


where  oug  ■  gm/Cg  for  the  output  gate  and  out  •  2Trfy  where  fy  is  the 
short-circuit  current  gain  bandwidth  product.  The  collector  resistance  is 
assumed  to  be  zero,  and  C|_Rl  represents  the  emitter  load. 


We  may  now  solve  Equation  (5)  and  (3)  to  obtain  the  emitter  current  as 


*e(t)  '  BQoAwell  [}f 


-t/kg 
TT 


{  (P  *  l)Tf  +  kgy 
(rf  +  k0) 


(6) 


where  k  «* 


It  is  obvious  from  (6)  that  the  emitter  current  provides  a  factor  of  0 
increase  in  the  charge  available  to  drive  the  next  stage.  Alternatively,  the 
load  capacitance  is  now  reflected  to  the  base  node  as  1/0  times  what  it  would 
be  in  a  simple  precharge  circuit.  Base  widths  of~l50()A  can  be  fabricated 
and  lead  to  an  out  of  much  greater  than  I  GHz.  So,  in  typical  layouts,  the 
total  effective  node  capacitance  of  this  output  device  is  dominated  by  the 
enlarged  output  gate  needed  to  enclose  the  base  structure.  Hence,  frequency 
response  of  this  output  structure  would  be  limited  by  the  output  gate. 
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Figure  54(a)  shows  a  theoretical  evaluation  of  the  emitter  current 
waveform  based  on  Equation  (6)  for  a  charge  packet  size  of  ~  1.6  fc.  The 
assumed  parameter  values  are  shown  in  the  figure.  The  simple  theory  presented 
here  is  compared  also  to  a  numerical  simulation  using  SPICE  circuit  simulator 
and  typical  MOS  parameters  Figure  54(b)  .  For  the  SPICE  simulation  we  have 
used  a  simplified  current  pulse  as  the  input  and  plotted  the  emitter  voltage 
as  the  output.  The  emitter  voltage  follows  the  emitter  current  prediction  of 
Figure  54(a)  based  on  the  charge  control  model.  It  has  been  assumed  in  this 
calculation  that  a  minimum  geometry  output  gate  node  can  be  built.  in 
parctice  the  output  gate  capacitance  is  larger  since  it  surounds  the  bipolar. 
This  will  limit  the  turn-off  characteristics  of  the  bipolar  device. 

This  application  of  the  bipolar  transistor  as  the  output  of  the  CCD  shift 
register  requires  that  the  0  fall-off  at  low  currents  be  minimized.  It  has 
been  very  well  known  that  the  low  current  0  fall-off  is  dominated  primarily  by 
injection  efficiency  considerations.  The  generation  component  of  current  in 
the  emitter-base  depletion  region  and  the  surface  recombination  component  play 
a  very  important  role  in  the  determination  of  the  low  current  fall-off 
Characteristics  of  the  bipolar  transistor.  The  incorporation  of  a  bipolar 
device  in  a  CCD  process  has  some  very  significant  advantages.  CCDs  are  built 
on  (100)  silicon  to  obtain  low  surface  state  density.  Gettering  sequences 
designed  into  the  process  allow  very  high  bulk  lifetimes  to  be  achieved  in 
processed  devices^*  16.  Bulk  lifetimes  of  ~  1  ms  and  surface  recombination 
velocities  of  <  I  cm/s  have  been  measured  in  our  devices.  In  addition  Hansel  I 
and  Fonstad!7  have  shown  that  the  6  linearity  at  low  current  is  affected  most 
significantly  by  decreasing  emitter  doping  to  take  advantage  of  the  increased 
mobility.  They  show  that  0  at  very  low  currents  is  obtained  at  the  expense  of 
the  absolute  6  value. 

In  an  exploratory  development  program  bipolar  structures  with  a  0  of ~800 
were  fabricated  as  well  as  those  wi th  0  ~ 100- 1 50.  The  variations  in  0  were 
achieved  by  varying  the  base  implant  dose.  The  typical  implant  distribution 
calculated  for  bipolar  devices  wi  th  0~1 00- 1 50  is  shown  in  Figure  55.  The 
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Figure  5^  (b)  SPICE  Simulation  of  the  Bipolar  Output.  The  response  to  a 

current  pulse  lg  is  shown.  This  simulation  is  more  optimistic 
than  the  charge  control  theory  because  default  parameters  for 
the  bipolar  are  used. 
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Figure  55  Doping  Distribution  Due  to  the  Base  and  Emitter  Ion  Implants 
Forming  the  PNP  Bipolar.  The  collector  distribution  is  non- 
uniform  because  of  the  channel  implants  required  to  avoid  short 
channel  effects  and  punch  through. 


estimated  fy  of  this  pnp  transistor  from  theoretical  considerations  is^  I 
GHz.  However,  the  frequency  response  of  the  bipolar  device  by  itself  cannot 
be  measured  since  it  is  integrated  with  an  FET  device  at  the  base.  The  3  vs 
lc  characteristics  of  these  devices  are  shown  in  Figure  56.  For  a  typical 
charge  packet  size  of  ~  4  x  10"15  c  and  a  gate  length  of  —  4  pm  the  effective 
base  current  pulse  is  about  26  nA.  Thus  for  3  ~  100  the  typical  operating 
region  is  a{  lc  '*'25  4A.  It  is  obvious  from  Figure  56  that  the  design  for  6 
*'100  has  adequate  low  current  0  linearity. 

The  bipolar  device  concept  and  design  discussed  in  the  previous  section 
may  be  operated  in  three  distinct  modes  at  the  output  of  a  CCD  shift  register. 
In  the  simplest  mode  the  bipolar  device  is  operated  with  a  fixed  emitter  bias 
obtained  through  the  emitter  load.  In  this  mode  the  charge  packet  turns  on 
the  bipolar  as  was  described  in  the  previous  section,  and  a  large  current  gain 
is  obtained.  However,  there  is  no  vo 1 tage  gain  available  because  the  emitter 
voltage  follows  the  base  voltage.  The  effective  charge  to  voltage  ratio,  of 
course,  depends  on  the  ratio  of  the  new  output  node  capacitance  to  the 
capacitance  of  a  storage  well.  The  use  of  the  bipolar  device  provides  an 
impedance  transformation  so  that  the  load  capacitance  on  the  emitter  node  is 
reflected  as  a  6  times  smaller  capacitance  at  the  base.  Thus,  for  a  given 
load  an  effective  increase  in  the  net  voltage  swing  may  be  obtained  for  a 
given  charge  packet  size.  This  increase  is  partly  offset,  however,  by  the 
increased  capacitance  components  due  to  base  emitter  capacitance  of  the 
bipolar  device.  The  operation  in  this  mode  has  the  advantage  of  very  low 
coupling  of  clocks  and  precharge  pulses  compared  to  a  normal  precharged 
output.  A  large  current  drive  capability  is  obtained,  but  the  voltage  swing 
is  not  significantly  increased. 


The  second  mode  of  operation  uses  a  dynamic  emitter  biasing  or 
precharging  of  the  emitter  node  before  the  charge  packet  flows  into  the  base. 
In  this  mode  the  emitter  is  floating  when  the  charge  flows  into  the  base  and 
the  transient  emitter  current  proportional  to  (0  +  I)  Is  produces  a  transient 
emitter  voltage  spike  that  may  be  much  larger  than  AV^e.  Thus,  a  large 


transient  voltage  swing  is  available  on  the  base,  which  may  be  used  as  a 
latching  signal  In  digital  applications. 


The  bipolar  output  may  also  be  operated  in  a  semi-nondestructive  mode. 
To  achieve  this,  the  device  is  built  into  one  of  the  elecrodes  of  the  shift 
register.  The  charge  packet  flows  into  the  base  and  remains  there  for  the 
clock  period.  During  this  time  the  bipolar  is  turned  on.  The  next  phase 
clocks  the  charge  out  of  the  base  into  the  next  storage  location.  Thus,  the 
signal  charge  may  be  propagated  after  the  read  operation  using  the  bipolar 
device.  The  signal  charge  used  to  provide  the  output  signal  corresponds  to 
the  fraction  of  the  charge  packet  which  recombines  in  the  base  during  the 
read  cycle.  Since  all  base  charge  does  not  have  to  recombine,  this  mode  of 
clocking  may  provide  an  extremely  high  speed  output. 

D.  Test  Bar  Design 

To  evaluate  the  capabilities  of  the  bipolar  output  device  in  high-speed 
analog  applications,  a  test  bar  was  designed  and  fabricated  that  included  the 
following  test  structures: 

•  2  0CCD  with  bipolar  output  (surface  and  buried  channel) 

•  2  0CCD  with  MOS  output  (surface  and  buried  channel) 

•  4  0C CD  with  bipolar  output 

•  4  0 CCD  with  MOS  output 

•  Single-stage  NMOS/PNP  buffer  amplifier 

•  Due  I -stage  NMOS/PNP  buffer  amplifier 

•  Bipolar  transistor  test  arrays  for  characterization  of  fj-  and 

•  MOS  transistor  array  located  in  proximity  to  the  bipolar  array  to 
test  for  threshold  voltage  variations  due  to  ohmic  drop  across  the  substrate 
caused  by  bipolar  collector  current 

•  Bipolar  test  array  for  measurement  of  Vgg  variations. 
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Schematic  diagrams  of  the  single  and  dual  stage  NMOS/PNP  buffers  are  shown  in 
Figure  57.  The  combined  use  of  enhancement  and  buried  channel  emitter  loads 
is  designed  to  provide  an  active  "pun-up"  via  the  enhancement  device, 
resulting  in  a  pseudocomplementary  output  stage. 

In  addition,  a  modified  version  of  the  32  x  32  CTM  is  included  that 
incorporates  the  bipolar  output  device.  The  CTM  geometry  was  reduced  by  80$ 
(linear)  to  evaluate  the  effect  of  smaller  gate  geometries. 

Figures  58  ,  59,  and  60  are  photomicrographs  of  the  completed  test  bar 
showing  the  location  of  the  important  test  structures. 

Initial  ion  implantation  parameter  for  the  base  and  emitter  implants  were 
extrapolated  from  those  used  in  the  previous  exploratory  development  effort. 
The  parameters  used  for  the  bipolar  test  bar  lot  were  distributed  across  the 
lot  and  bracketed  the  initial  values.  The  energy  and  dose  parameters  for  each 
slice  are  listed  in  Table  1. 
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a  .  j+J&*  -*‘r\4-**'*W1** 


Figure  58  Bipolar  Test  Device  Chip  From  Bipolar  Test  Bar 


From  Bipolar  Test  Bar 


Figure  60  32  x  32  CTM  With  Bipolar  Output 


Table  1 

Bipolar  Test  Bar  Implant  Parameters 


Slice  No. 


Base  Implant 


Emitter  Implant 


1 

1 

X 

10l3/cm2, 

3'p. 

350  keV 

2 

X 

14  2 

10  /cm  , 

B, 

50  keV 

2 

1 

X 

1013/cm2, 

3lp. 

350  keV 

3 

X 

.-14.  2 

10  /cm  , 

' 1 B , 

50  keV 

3 

1 

X 

I013/cm2, 

3'p. 

350  keV 

4 

X 

14  2 

io  V. 

MB, 

50  keV 

4 

2 

X 

l0*3/cm2 , 

3,p. 

350  keV 

2 

X 

14  2 

10  /cm  , 

"Bp 

50  keV 

5 

2 

X 

10,3/cm2, 

3V 

350  keV 

3 

X 

14  2 

10  /cm  , 

'V 

50  keV 

6 

2 

X 

I013/cm2, 

3V 

350  keV 

4 

X 

14  2 

10  /cm  , 

UB, 

50  keV 

7 

3 

X 

10,3/cm2, 

3IP. 

350  keV 

2 

X 

14  2 

10  /cm\ 

’’b, 

50  keV 

8 

3 

X 

I013/cm2, 

3,p. 

350  keV 

3 

X 

,J4/  2 

10  /cm  , 

HB, 

50  keV 

9 

3 

X 

lo'3/cm2. 

3,p. 

350  keV 

4 

X 

.J1*/  2 

10  /cm  , 

nB, 

50  keV 

10 

4 

X 

I013/cm2, 

31  p. 

350  keV 

2 

X 

14  2 

10  /cm\ 

1!B, 

50  keV 

11 

4 

X 

lo'3/cm2. 

3,p. 

350  keV 

3 

X 

,14 .  2 

10  /cm  , 

l,B, 

50  keV 

12 

4 

X 

10*3/cm2 , 

3,p. 

350  keV 

3 

X 

i _ 14  .  2 

10  /cm  , 

UB, 

50  keV 
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SECTION  VI  I 


BIPOLAR  TEST  BAR  EVALUATION 


A.  Bipolar  Device  Evaluation 

PNP  test  transistors  on  the  bipolar  device  test  chip  from  slices  2,  5,  8, 
and  II  were  used  for  the  initial  device  evaluation.  A  schematic  diagram 
showing  the  equivalent  circuit  for  these  test  devices  appears  in  Figure  61. 
The  gate  of  the  buried  channel  MOS  transistor  was  connected  to  the  emitter, 
thereby  ensuring  a  low  resistance  base  connection.  Dc  characteristics  were 
obtained  using  a  curve  tracer,  and  typical  results  are  shown  in  Figure  62. 
The  base-emitter  window  for  these  devices  had  dimensions  of  0.4  x  0.8  mils 
(10.2  pm  x  20.3  urn).  Betas  ranged  from  6  to  40  and  increased  with  reduced  base 
doping  density,  which  results  in  a  wider  base-collector  depletion  region  and  a 
narrower  base  width.  A  parasitic  collector  resistance  of  about  4  K  is  evident 
in  the  figure  and  is  due  to  the  high  resistivity  of  the  substrate  (~250-cm). 

A  limited  number  of  devices  have  been  tested  using  a  high  frequency 
network  analyzer.  The  results  of  these  tests  are  shown  in  Table  2.  The  dc 
betas  correspond  well  with  the  earlier  tests,  and  the  minimum  geometry  devices 
exhibit  cutoff  frequencies  in  excess  of  the  I  GHz  measurement  limit. 

B.  PNP/NMOS  Buffer  Evaluation 

The  slngle-and  dual-stage  output  buffer  circuits  shown  In  Figure  57 
were  included  as  test  structures  on  the  device  test  chip.  Only  the 
single-stage  circuit  has  been  evaluated.  Figure  63(a)  shows  the  pulse 
response  of  the  single-stage  buffer  driving  a  50  pF  load.  The  pull  up 
transient  is  due  to  the  NflOS  devices  and  exhibits  the  characteristic 
l-(l  +  tA  )"1  transient  with  t  ■  C|./gmo  '■>'18  ns,  where  g^o  is  the  device 
transconductance  at  t  »  0+.  Figure  63(b)  shows  the  pull  down  transient  due 
to  the  bipolar  device,  which  exhibits  a  fall  time  of  about  3  ns.  Since  this 
is  of  the  same  order  as  the  fall  time  of  the  of  the  Input  pulse,  it  is  not 
indicative  of  the  ultimate  performance  of  the  bipolar  device. 

103 


Slice  No. 


Table  2 

High  Frequency  Bipolar  Device  Test  Results 


Base/Emitter 

Geometry 


5 

0.4  x  0.4  mi  Is 

0.25  mA 

21 

>  1  GHz 

5 

2  x  2  mi  Is 

1  mA 

18.5 

300  MHz 

8 

0.4  x  0.4  ml  Is 

0.25  mA 

12.8 

>  1  GHz 

8 

2x2  mils 

1  mA 

12.4 

200  MHz 

106 


Input  Pulse 
5  V/div 

Buffer  Output 
5  V/div 


Horizontal  200  ns/d 


(a) 


Vertical 
2  V/div 

Hor izonta I 
5  ns/div 


(b) 


Pulse  Response  of  the  Single  Stage  PNP/NMOS  Buffer 
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Bipolar  CCD  Output  Circuit  Evaluation 

Evaluation  of  the  bipolar  device  In  CCD  output  circuit  applications  was 
accomplished  using  the  dual-channel  99-stage  two-phase  buried  channel  CCD  test 
structure  on  the  CCD  test  chip.  One  channel  of  this  device  employs  a 
conventional  reset  amplifier  in  the  output  circuit,  while  the  second  channel 
employs  a  bipolar  device  with  additional  peripheral  circuitry  to  allow 
operation  in  any  of  the  modes  described  in  the  previous  section. 

Each  CCD  channel  is  5  mils  (127  pm)  wide,  and  the  transfer  gate  length  is 
0.5  mils  (12.7pm).  The  emitter/base  window  for  the  bipolar  device  is  0.**  x 
0.1*  mils  (10.2  x  10.2pm)  and  is  placed  within  the  well  region  of  an  extended 
first-level  gate.  In  the  tests  described  below  the  bipolar  device  was 
utilized  as  an  emitter  follower  with  emitter  bias  established  with  an  external 
load  resistor. 

Figure  6k  shows  the  input  test  waveform  and  output  response  for  both  CCD 
channels  at  a  10  MHz  clock  rate.  The  external  load  resistor  for  the  bipolar 
device  was  470  0.  Note  that  the  output  traces  were  photographed  at  different 
sensitivities,  and  the  peak-to-peak  values  of  the  output  signals  are 
approximately  equal. 

Figure  65(a)  shows  an  expanded  trace  of  the  reset  amplifier  output 
waveform.  The  reset  pulse  was  about  5  ns  in  duration,  and  its  timing  relative 
to  the  output  clock  phase  resulted  in  a  duty  cycle  of  about  701.  Figure  65(b) 
shows  the  swept  frequency  response  of  the  delay  line,  which  exhibits  the 
expected  sin  x/x  spectrum.  The  response  above  10  MHz  is  degraded  by  direct 
feedthrough  of  the  signal  from  the  tracking  generator  due  to  the  experimental 
setup. 

The  transient  response  of  the  bipolar  output  circuit  can  be  obtained  from 
the  equivalent  circuit  shown  In  Figure  66.  The  transistor  is  modeled  with  a 
pi-equivalent  circuit  and  the  charge  transfer  with  the  current  source.  The 
differential  equations  describing  Vg(t)  and  Vg(t)  are 


108 


Input  Signal 
100  mV/div 


Input  Signal 
100  mV/div 

Output 
50  mV/d iv 


Horizontal  2  us/div 


(b) 


Figure  64  Response  of  the  Dual  Channel  CCD  Test  Structure  at  a  10  MHz  Clock 
Rate.  (a)  Reset  amplifier  and  (b)  Bipolar  output  with  external 
470  O  Load. 
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VB  + 


f(P  -  ')  +  J _  + _ Ll  j  +  _!s _ 

LreCL  \CL  reCBj  B  reRLCBCL 

=  pL±_Li  +  _Ll  +  j> 
CB  L  reCL  RLClJ  CB 


VE  "  VB  +  reCB  VB  -  'ore 


The  complementary  solution  of  (  7)  is  found  from 


VB  +  (a  +  b  +  c)  WB  +  be  VB  =  0 


where  a  =  (g  +  l)/r  C. 

e  L 

b  =  1/RlCl 
c  '  l/reCB 

Assuming  solutions  of  the  form  e"^*,  it  is  found  that  q  must  satisfy 


q2  -  (a  +  b  +  c)q  +  be  »  0 


for  which  solutions  are: 

q  =  (a  +  b  +  c)/2  +  [(a  +  b  +  c)  -  4bc]V2 

2  1 

q  =  (a  +  b  +  c)/2  -  [{a  +  b  +  c)  -  4bc]V2 


Further  assuming  b<"<a,c,  these  roots  lead  to  two  character  i  st  ic  time 
constants  given  approximately  by: 


Tl  ~  re(CB  +  CL/0) 

T2  ~  RLCL  +  (re  +  eRL)CB 


\1 


1 12 


where  t  j  characterizes  the  turn-on  transient  and  t2  characterizes  the 
recovery  transient. 


Figure  67(a)  shows  the  output  waveform  for  the  bipolar  output  with  a  470 
O  load  resistor.  The  observed  turn-on  transient  is  estimated  to  correspond  to 
a  time  constant  of  ti  ~  2 . 3  ns,  while  the  recovery  time  constant  is 
approximate  1  y  t  2  ~  27  ns  (t  /T  ~  0 .27 )  •  The  small  si  gna  1  em  itter  resistance 
can  be  estimated  from  re  *  kT/qlg  and  exhibits  an  initial  value  of  about 
200  0.  The  base  capacitance  Cg  is  estimated  from  geometrical  considerations 
to  be  about  0.4  pF.  Assuming  f)  =  10  and  C|_  ~  6.5  pF,  the  calculated  values 
of  the  time  constants  are  tj  ~  0.2  ns  and  r  2  ~  5  ns,  both  considerably 
smaller  than  observed. 

Note  the  presence  of  a  positive  "glitch"  on  the  falling  edge  of  the 
turn-on  transient.  This  transient  is  attributed  to  the  parasitic  collector 
resistance  that  causes  a  positive  excursion  of  the  substrate  (collector) 
voltage  during  the  initial  current  transient  and  modulates  lg  through  the 
Early  effect.  It  is  felt  that  this  glitch  masks  the  high-speed  turn-on 
transient. 

The  simplified  transient  analysis  presented  above  neglected  the 
dependence  of  the  small  signal  emitter  resistance  re  on  the  emitter  current, 
re  *  kT/ql£*  This  dependence  results  in  a  positive  feedback  effect  that 
reduces  the  base  current,  especially  when  low  values  of  Rl  «re  employed. 
This  mechanism  i$  probably  responsible  for  the  extended  recovery  transient. 

Figure  67(b)  shows  the  frequency  response  of  the  delay  line  with  the 
bipolar  output.  The  0  dB  reference  is  the  same  as  in  Figure  65(b),  and  the 
signal  energy  contained  in  the  first  Nyquist  Interval  for  the  bipolar  output 
is  comparable  with  that  for  the  reset  amplifier.  The  frequency  response  is 
down  by  about  -3  dB  at  the  Nyquist  rate,  corresponding  closely  to  the  r  »  T/4 
case  presented  in  Figure  51  of  the  previous  section. 
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I'  1 


jh\ 

f'  /  V 


Vertical 
50  mV/div 

Hor izonta I 
20  ns/div 


(a) 


Vert i ca I 
10  dB/div 

Hor i zonta 1 
2  MHz/d iv 


(b) 


Figure  67  Output  Waveform  and  Frequency  Response  Due  to  Bipolar  Output 
Stage  With  470  fi  Load.  (a)  Bipolar  output  waveform  with 
470  0  load.  (b)  Delay  line  frequency  response. 


1 14 


Figure  58  shows  the  output  waveform  and  frequency  response  for  the 
bipolar  output  with  a  1000  f)  load  resistor.  Note  that  the  low  frequency 
response  in  Figure  68(b)  is  about  9  dB  above  that  in  Figure  67(b),  whereas 
only  a  6  dB  increase  is  theoretically  expected.  This  improvement  is 
attributed  to  the  re  term,  which  increases  by  about  a  factor  of  2  in  this 
case.  The  response  at  the  Nyquist  rate  is  down  by  6  dB  and  corresponds  to r  ~ 
0.7  T.  Note  that  the  average  energy  in  the  first  Nyquist  interval  is  nearly 
10  dB  greater  than  observed  for  the  reset  amplifier  in  Figure  65(b). 

D.  32  x  32  CTM  With  Bipolar  Output 

As  in  the  initial  lot  of  32  x  32  CTHs,  the  bipolar  test  lot  exhibited  a 
high  incidence  of  intralevel  shorts  on  the  second  polysilicon  gate  level.  A 
number  of  32  x  32  CTM/bipolar  output  chips  were  prescreened  by  manual  probing 
and  were  packaged.  None  of  these  devices  were  found  to  be  operational. 

The  cause  of  the  intralevel  shorts  is  unexplained,  since  the 
"zero-undercut"  etch  technique  was  employed  in  processing  the  bipolar  lot. 
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Vertical 
50  mV/div 

Hor izonta I 
20  ns/div 


la) 


Vertical 
10  dB/div 

Hor izonta I 
2  MHz/d iv 


lb) 


Figure  68  Output  Waveform  and  Frequency  Response  Due  to  Bipolar  Output 
Stage  With  I  kfl  Load.  (a)  Bipolar  output  waveform  with  1  tf) 
load.  (b)  Delay  line  frequency  response. 
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SECTION  VIII 
CONCLUSIONS 


The  primariy  objective  of  this  program  was  the  development  of  an  analog 
CCD  reformatting  memory  for  high  speed  signal  processing  applications.  Based 
on  previous  experience  with  line-addressed  structures,  a  new  approach  was 
proposed  that  utilized  a  two-dimensional  charge  transfer  cell  in  a  square 
memory  array  with  integral  CCD  multiplexers  and  demultiplexers  to  provide  the 
interface.  In  principle  this  approach  offered  higher  performance  because  of 
its  potential  for  lower  f#ixed  pattern  noise  and  higher  operating  rates. 
However,  it  was  recognized  that  the  complexity  of  this  unique  charge-transfer 
device  resulted  in  a  moderately  high  technical  risk. 

During  the  course  of  this  program,  five  lots  of  devices  were  fabricated. 
The  first  lot  consisted  of  a  test  bar  that  included  a  32  x  32  element  CTM  and 
numerous  CCD  test  devices.  Although  half  of  the  lot  failed  due  to  an 
oversight  in  specifying  the  process  parameters,  operational  CTMs  were 
obtained,  and  their  performance  was  observed  to  be  nearly  as  anticipated. 
Fixed  pattern  noise  performance  across  an  entire  reformatted  frame  was  observed 
to  be  at  least  an  order  of  magnitude  better  than  in  line-addressed  designs. 
Because  of  a  layout  error,  it  was  not  possible  to  operate  these  devices  in  a 
100?  duty  cycle  mode. 

Two  photomask  levels  were  subsequently  modified,  and  two  additional  lots 
were  processed:  the  first  on  standard  two-inch  diameter  starting  material  and 
the  second  on  three-inch  material  known  to  have  more  uniform  oxygen 
distribution.  It  was  anticipated  that  the  devices  fabricated  using  the  latter 
would  exhibit  even  better  fixed  pattern  noise  performance  due  to  a  more 
uniform  impurity  distribution.  This  was  found  to  be  the  case,  and  100?  duty 
cycle  operation  was  observed  with  about  a  6  dB  improvement  in  fixed  pattern 
noise.  These  devices  were  subsequently  utilized  in  signal  processor  subsystem 
concept  demonstrations  including  a  radar  doppler  processor  and  two-dimensional 
discrete  Fourier  transform.  The  former  was  documented  in  the  interim  report 
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prepared  during  this  program;  the  latter  was  described  in  a  paper  at  the  1978 
CCD  Applications  Conferences  and  a  reprint  is  included  as  Appendix  C. 

The  CTM  structure  was  redesigned  with  minor  modifications  to  realize  a 
64  x  64  element  array.  This  device  was  also  processed  on  three-inch  material, 
and,  although  the  yield  was  low  due  to  a  high  incidence  of  inter  level  shorts, 
a  number  of  operational  units  were  obtained,  and  their  performance  documented. 
These  devices  have  been  operated  at  a  40  MHz  clock  rate  with  the  primary 
performance  limitation  due  to  parasitic  coupling  of  the  clock  pulses  to  the 
output  node.  Although  this  noise  component  can  be  removed  using  a  double 
sampling  approach,  this  solution  is  considered  undesirable  at  high  clock 
rates.  Further  study  should  be  directed  towards  on-chip  reduction  of  the 
parasitic  coupling  mechanism. 

The  fifth  lot  of  CTM  devices  fabricated  during  this  program  was  a  32  x  32 
test  device  that  was  included  on  the  bipolar  test  bar.  This  device  was  a 
"shrink"  version  of  the  previous  32  x  32  structure,  has  a  1 .6  mi  1  cell 
spacing,  and  included  a  bipolar  output  device.  Unfortunately,  the  bipolar 
test  bar  has  exhibited  a  high  incidence  of  intralevel  shorts  (similar  to  the 
problem  observed  in  the  first  lot  of  32  x  32  devices)  and  operational  units 
have  not  been  found. 

The  secondary  objective  of  this  program  was  the  development  of  a  high-speed 
device  for  use  in  CCD  output  circuits.  The  results  of  an  internal  exploratory 
development  program  led  to  the  decision  to  develop  a  CCD  process  compatible 
PNP  bipolar  device  for  this  application.  The  bandwidth  capabilities  of  this 
device  are  well  in  excess  of  those  attainable  with  NMOS  devices.  In  addition, 
a  particularly  efficient  output  circuit  configuration  results  that  is 
compatible  with  further  continuous-time  (as  opposed  to  sampled)  signal 
processing,  e.g.,  as  Is  the  case  for  surface  acoustic  wave  (SAW)  device  signal 
processing. 


A  bipolar  device  test  bar  was  designed  and  fabricated  and  preliminary 
evaluation  shows  that  the  transistors  exhibit  a  cutoff  frequency  in  excess  of 
I  GHz.  Analysis  of  the  operation  of  an  experimental  bipolar  CCD  output 
circuit  shows  the  frequency  response  of  the  output  circuit  to  be  in  good 
agreement  with  theoretical  predictions. 

In  summary,  the  results  of  this  program  have  been  excellent.  The 
two-dimensional  charge  transfer  CTM  has  been  successfully  demonstrated  and 
shown  to  be  a  powerful  addition  to  the  analog  signal  processing  repertoire. 
The  bipolar  output  technique  has  been  successfully  demonstrated,  although  a 
more  detailed  analysis  is  needed.  Further  efforts  should  be  directed  toward 
reducing  parasitic  coupling  of  the  CCD  clocks  to  the  output  node  and  more 
precise  identification  of  the  source  of  the  remaining  fixed  pattern  noise 
observed  in  the  CTMs. 
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APPENDIX  A 
CTH  TIMING  CIRCUITS 


Operation  of  the  two-dimensional  charge  transfer  CTM  with  1001  duty  cycle 
requires  the  generation  of  15  individual  clock  waveforms.  These  waveforms  are 
listed  below,  identifying  the  abbreviations  used  in  the  diagrams  that  appear 
in  this  appendix. 

#2»  Common  high-speed  multiplexer  and  demulti¬ 
plexer  clock  phases. 


03MUX: 

03DMXH*  03DMXV: 
IPG: 

PRE: 

0M2H: 

0M2V ' 

MUXh: 

MUXy : 


waveform  employed  in  the  multiplexer 

0^  waveform  employed  in  the  demultiplexer 

Demultiplexer  input  gate  (sampling) 
pulse 

Multiplexer  output  circuit  reset  pulse 

Common  memory  array  clock  phase 

Memory  array  clock  phase  controlling 
horizontal  transfers 

Memory  array  clock  phase  controlling 
vertical  transfers 

Horizontal  multiplexer:  array  transfer  pulse 
Vertical  multiplexer:  array  transfer  pulse 
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DHX^:  Horizontal  demultiplexer:  array  transfer 

pulse 

DMXy:  Vertical  demultiplexer:  array  transfer  pulse. 


A  block  d iag ram  showi ng  the  basic  organization  of  all  the  timing  circuits 
is  shown  in  Figure  69.  Four-phase  clock  waveforms  are  derived  from  a 
high-speed  master  clock  and  directly  applied  to  a  binary  counter  having  a 
modulus  equal  to  the  number  of  elements  per  field  (1024  for  the  32  x  32,  4096 
for  the  64  x  64).  The  carry  pulse  from  the  counter  toggles  a  flip-flop  whose 
output  state  determines  whether  array  charge  transfer  is  horizontal  or 
vertical.  This  flip-flop  is  disabled  in  the  SPS  mode.  The  remaining  clock 
waveforms  are  derived  from  the  four-phase,  counter  and  H/V  gating  pulses. 

Figure  70  shows  the  clock  generator  circuit  used  for  initial  evaluation 
of  the  32  x  32  CTM.  Schottky  TTL  was  employed  in  this  circuit,  and  the  part 
numbes  are  identified  in  Table  3.  TTL  to  MOS  level  translation  was 
accomplished  using  SN75363  integrated  driver  circuits  for  the  high-speed 
clocks  and  SN75361  drivers  for  the  memory  array  clocks. 

Figure  71  shows  the  clock  generator  circuit  that  eliminates  clock  pulse 
coupling  to  the  output  circuit  by  reducing  the  duty  cycle  to  251.  Thus,  only 
a  single  DMX^  transfer  pulse  and  MUXy  transfer  pulse  is  generated.  The  T2 
waveform  is  furnished  externally  and  allows  synchoronization  of  the  timing 
circuit  with  other  signal  processors.  A  parts  list  appears  in  Table  4. 


Figure  72  shows  the  high-speed  clock  generator  circuit  for  the  64  x  64 
CTM.  The  design  is  basically  the  same  as  shown  in  Figure  70,  except  that  the 
field  counter  is  expanded  and  ECL  logic  components  are  employed.  Figure  73 
illustrate  the  high-sped  pulse  generation  capabilities  of  this  circuit. 
Figure  73(a)  shows  and  <t>2  waveforms  that  are  derived  from  a  200  MHz  master 
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Table  3 

Parts  List  for  Figure  70 


Desiqnation 

Part  Number 

Al 

SN74S30 

A2 

SN74S 16 1 

A3 

SN74S161 

A4 

SN74S 16 1 

A5 

SN74S I 12 

A6 

SN74SU2 

A7 

SN74S112 

Bl 

SN74S04 

82 

SN74S74 

B3 

SN74SOO 

B4 

SN74S74 

B5 

SN74S10 

B6 

SN74S10 

B7 

SN74S74 

C 

SN75363  driver 
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Table  4 

Parts  List  for  Figure  71 


Designation 

A1 

A2 

A3 

A4 

A5 

A6 

A7 

A8 

A9 

AIO 

All 

AI2 

A13 

Al4 

A15 


Part  Humber 
SN74SH2 
SN74S112 
SH74SH2 
SN74S10 
SN74S74 
SN74S163 
SN74S163 
SN74S74 
SN74S00 
SN74S04 
SN74S02 
SN74S10 
SN74S74 
SH74S74 
SN74S32 
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clock.  Figure  73(b)  shows  the  030HXH  clock  waveform  and  the  DMXh  transfer 
pulse,  and  shows  how  the  03  waveform  is  Inhibited  during  the  parallel  transfer 
to  ensure  complete  charge  transfer. 

Figure  7k  shows  circuit  diagrams  of  the  clock  driver  circuits  employed  in 
operation  of  the  6k  x  64  CTM.  Clock  driver  circuit  1  was  used  to  buffer  the 

0j  »  0  2*  0j  03DMXH  ®3DMXV  and  \  clock  waveforms.  Clock  driver 
circuit  2  was  employed  for  the  remaining  high-speed  transfer  pulses,  the  input 

sampling  pulse,  and  output  circuit  reset  pulse.  Figure  75  shows  the  DMXh 
waveform  (driver  circuit  2)  and  the  03qhxh  waveform  (driver  circuit  I)  at 
a  25  MHz  clock  rate.  At  clock  rates  in  excess  of  40  MHz,  variations  in 
propagation  delays  through  the  ECL-TTL  convertors  result  in  improper  phasing 
of  the  clock  waveforms. 


Table  5  is  a  parts  list  for  the  high-speed  clock  generator  circuit. 


ECL  TO  TTL 


To  CTM 


2N3866 


2N5160 


Driver  Circuit  1 


2N3866 


10125 

ECL  TO  TTL 


ro  CTM 


2N3866 


Driver  Circuit  2 


3  4 
7  5 
11  12 

15  13 
14  10 


MC  10125 

ECL  to  TTL  Translator 


1 U— 

SN7536I 

TTL  to  MOS  Translator 


Figure  64  x  64  Clock  Driver  Circuits 


Table  5 

\ 

Parts  List  for  the  High  Speed  Clock  Generator 


Des i gnat  ion 
A) 

A2 

A3 

A4 

A5 

A6 

A7 

A8 

A9 

Bl 

B2 

B3 

B4 

B5 

B6 

87 

B8 


Part  Number 
MCI  0135 
MC10135 
HC 10135 
MC 10135 
MC 1 0 1 36 
HC 1 0 1 36 
HC 1 0 1 36 
MCI0109 
MCI  01 01 

MC 1 0 1 02 
MC10I06 
MC1013 1 
MC10104 
MC10104 
MC 1 0 1 3 1 
HC 1 0 1 02 
MC1013 1 
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APPPENDIX  B 

DERIVATION  OF  CCD  OUTPUT  CIRCUIT  TRANSFER  FUNCTION 


1 .  Reset  Amp I  If ier 

The  output  circuit  is  modeled  as  shown  In  Figure  76(a).  The  output 
charge  transfer  is  modeled  by  a  current  source  ls(t)  given  by 

a> 

',W  =  n L»  S(t  "  nT)  'o  (t  "  nT)  02) 

where  s(t)  Is  the  signal  waveform  and  lQ(t)  describes  the  current  transient 
due  to  charge  transfer.  The  reset  switch  is  closed  during  the  period  nT  +  Tg 
<  t  <  (n  +  1  )T.  Vout(t)  at  the  output  diode  is  to  be  calculated.  The 
effect  of  an  output  buffer  amplifier  is,  therefore,  ignored.  Vout(t)  ,s 
given  by 

t 

Vout(t)  “  Vref  '  C  J  ■«<»><*».  nT  <  t  <  nT  +  t  (13) 

o=nT  s 

*  vref»  "T  +  ts  <  t  <  (n  +  1)T 


The  spectrum  of  V  ^(t)  is  given  by  its  Fourier  transform: 


V  4»)  ®  I  V  .  (t)  e  JU)t  dt 

out^7  -*»  out 


nT+rs 


r  1  _  1  f 

"n=-»  t=nT  I  "ref  C  cfcnT 


|v„.  -  7J..T  lSW*l  e  J  <*« 


'] 


-j(Bt 


(n+l )T 

+  £  J  V _ *  e'J<wt  dt 


nT+r 


ref 


(14) 
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Now  apply  Poisson's  sum  formula^® 


S  0(nT)e-ju)nT  =  I  £  *L  +  *™\ 

n<“~* 0  T  n=-cc  I w  j  l 


(19) 


where  $(u>)  is  the  Fourier  transform  of  $(t).  Then  ( 1 8 )  becomes 


Where  S(ou)  is  the  Fourier  transform  of  s(t). 


2.  RC  Output  Circuit 

The  output  circuit  is  modeled  as  shown  in  Figure  76(b).  The  output 
spectrum  is  given  by: 


V 

ou 


t 


<n>) 


(21) 


|g(t)  has  the  form  used  previously. 

Thus, 

00 

!(«,)  =  Jr  S(t  -  nT)  I  (t  -  nT)  e'Jwt  dt 
s v  /  -oo  n=-«  v  o 

00 

=  r  S(nT)  J  l0  (t  -  nT)  e"ju,t  dt 
n=-® 

=  S  S(nT)  e~jwnT  I  fo)  •  (22) 

fl— oo  '  O 


11*0 


sson's  sum  formula,  (22)  can  be  expressed 


Again  using  lc(t)  -  lc  f, (t )  and  Poi 
as: 


ls  (u>) 


(23) 


Thus, 


Vout(u))  " 


'ref 


6(u)) 


(24) 


and  multiplying  numerator  and  denominator  by  C  yields 


/  (<b)  =  V  *  6(<o)  -  ~  -S 

outv ’  ref  '  T  C 


(25) 


which  allows  direct  comparison  with  the  spectrum  of  the  reset  amplifier  given 
in  (20). 
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APPENDIX  C 


A  CCD  TWO  DIMENSIONAL  TRANSFORM 


W.L.  Eversole,  D.J.  Mayer,  and  R..J.  Kansy 

Texas  Instruments  Incorporated 
Dallas,  Texas  75222 

ABSTRACT.  The  two-dimensional  transform  is  recognized  as  a  valuable  tool  for 
processing  two  dimensional  signals.  It  has  applications  in  bandwidth  re¬ 
duction  systems,  image  enhancement  systems,  correlation  trackers,  seismic 
array  processors,  and  radar  and  sonar  systems.  A  32  x  32  point  discrete 
Fourier  transform  has  been  built  using  two  low  power  CCD  integrated  circuits  - 
a  32  point  DFT  IC  and  a  32  x  32  point  reformatting  CCD  memory.  The  detailed 
design  and  operation  of  this  two  dimensional  transform  breadboard  are 
described . 


I.  INTRODUCTION 

Since  the  introduction  of  the 
charge-coupled  device  (CCD)  in  1970  , 
the  CCD  has  become  an  important  ele¬ 
ment  for  implementing  many  signal 
processing  functions,  one  of  which  is 
spectral  analysis  via  the  chirp  Z 
transform  (CZT)  algorithm.2-3  This 
algorithm  for  obtaining  the  discrete 
Fourier  transform  (DFT)  is  ideally 
suited  to  CCD  implementation  as  the 
bulk  of  the  computation  may  be  per¬ 
formed  in  CCD  transversal  filters. 

In  1977,  a  monolithic  32  point  CCD 
DFT  IC  which  implements  the  chirp  7, 
transform  algorithm  was  designed  and 
fabricated.  with  this  chip  a  32 
point  one-dimensional  DFT  may  be  ob¬ 
tained  in  64  us  with  a  power  dissi¬ 
pation  of  600  mw.  Although  the 
one-dimensional  DFT  has  many  applica¬ 
tions,  there  are  a  variety  of  appli¬ 
cations  which  require  the  processing 
of  two  dimensional  siqnals  such  as 
obtained  from  images,  seismic  arrays, 
and  sonar  and  radar  systems.  The  two- 
dimensional  DFT  may  be  realized  via 
the  one  dimensional  DFT  by  first 
transforming  the  rows  of  a  two  di¬ 
mensional  array  and  then  transforming 
the  columns  of  the  resulting  two 
dimensional  array.  To  implement 
this,  the  data  output  of  the  first 
transformation  must  be  reformatted 
to  become  the  input  data  for  the 
second  transformation.  In  1978  a  32x 
32  point  corner  turning  memory  ( CTM) 
using  a  novel  two  dimensional  charge 
transfer  structure  was  designed  and 
fabricated.  This  CCD  CTM  is  ideally 
suited  to  perform  the  reformatting 
opera t i on . 


This  paper  describes  the  imple¬ 
mentation  of  a  3?  x  32  point  two 
dimensional  discrete  Fourier  trans¬ 
form  using  the  32  point  CZT  IC  and  the 
32  x  32  point  corner  turning  memory. 
Section  II  reviews  how  the  two  dimen¬ 
sional  DFT  may  be  implemented  with 
one  dimensional  DFTs .  Sections  III 
and  IV  describe  in  detail  the  32 
point  CCD  CZT  IC  and  the  32  x  32  point 
CCD  corner  turning  memory.  Section  V 
presents  the  results  of  implementing 
the  two  dimensional  DFT  with  these 
two  CCD  ICs . 

II.  THE  TWO  DIMENSIONAL  DFT 

The  DFT  of  a  finite  area  sequence 
f(m,n)  is 

M-l  N-l  -  j2  it  (  —■*•—) 

F ( k ,  ?.)=;:  T.  f  (m,n)  e  M  N  (1) 

m=o  n=o 


The  two  dimensional  DFT  can  be 
interpreted  in  terms  of  the  one 
dimensional  DFT  by  expressing  Equa¬ 
tion  ( 1 )  as 


M-l 

F(k,n=  c,(m,u 
m=o 


where 


N-l 

G  ( m ,  i  )  -  '  f  ( m  ,  n ) 

n  =  o 


(2) 


(  3) 


The  function  G{m,t )  corresponds 
to  an  N-point  one  dimensional  DPT 
for  each  value  of  m,  i.e,  it  con¬ 
sists  of  M  one  dimensional  trans¬ 
forms,  one  for  each  row  of  f(m,n). 
The  two  dimensional  DFT  F(k,t)  is 
then  obtained  by  performing  N  one 
dimensional  transforms,  one  for 
each  column  of  the  sequence 
G(m,J).  Thus,  the  two  dimensional 
transform  can  be  implemented  as 
shown  in  Figure  1. 


OUTPUT 


low  weight,  low  power,  and  high 
speed  are  achieved  with  total 
integration. 

The  chirp  z  transform  algorithm 
is  derived  by  starting  with  the 
definition  of  the  discrete  Fourier 
trans  form: 


Fk 


N— 1 

r. 

n*o 


2ti  kn 
N 


and  making  the  substitution: 
2nk  =  n2  +  k2  -  (n-k)2 


(4) 


(5) 


The  chirp  Z  transform  equation 
results : 


,2  2  (l  2 
-jiUs  N-i  -vu  jn  yt-D) 

Fk«e  N  |  v  (fne  N  )e  N  !  (6) 
n=o 


Figure  1.  Implementation  of  Two 
Dimensional  Discrete  Fourier 
Transform. 

A  one  dimensional  transform  is  per¬ 
formed  on  the  rows  of  the  signal 
using  a  CCD  CZT  IC.  The  resulting 
complex  Fourier  coefficients  are 
stored  row  by  row  in  two  CCD  corner 
turning  memories.  Next  these  co¬ 
efficients  are  read  out  column  by 
column  into  a  second  CCD  CZT  IC 
which  completes  the  calculations 
for  the  two  dimensional  transform. 

III.  32  POINT  CCD  CZT  IC 

A  monolithic  32  point  DFT 
using  the  chirp  Z  transform 
algorithm  has  been  designed  and 
fabricated  using  an  N  channel, 
two  level  polysilicon  coplanar 
electrode  process.  Goals  of  the 
design  included  the  elimination 
of  all  external  support  components 
and  operation  of  the  transform  IC 
at  a  1  MHz  data  rate.  Small  size, 


Equation  6  has  been  factored  to 
emphasize  the  three  operations  which 
make  up  the  CZT  algorithm:  (1)  pre¬ 
multiplying  the  time  signal  with  a 
chirp  (linear  FM)  waveform,  (2) 
filtering  in  a  chirp  convolution 
filter,  and  (3)  postmultiplying  the 
Fourier  output  by  a  chirp  waveform. 
This  is  illustrated  in  Figure  2. 


f  chirp  impnhe  rnponw  '  N 


Down  Chirp  Dpwn  Chirp 


Figure  2.  Schematic  of  Chirp  Z 
Transform  Algorithm. 


The  chirp  Z  transform  IC  is  divided 
into  two  sections.  The  first  sec¬ 
tion  contains  the  multipliers  which 
are  implemented  using  multiplying 
dig i tal-to-analog  converters  ( MDACs ) . 
Analog  input  signals  applied  to  the 
MDAC  reference  terminals  are  multi¬ 
plied  by  binary  coded  chirp  waveforms 
stored  in  a  ROM.  External  digital 
inputs  are  provided  to  bypass  the  ROM 
for  mul t ipl icaton  by  other  waveforms. 
Differential  MDAC  analog  inputs,  and 
uncommitted  outputs  allow  maximum 
flexibility  in  total  system  configu¬ 
ration. 

The  second  portion  of  the  chip 
performs  the  chirp  filtering  oper¬ 
ation.  Four  63  stage  transversal 
filters  are  needed  to  implement  the 
complex  chirp  convoluton  required  by 
the  CZT  algorithm.  The  weighting 
coefficients  for  the  sine  and  cosine 
chirp  filters  are: 


dimensional  transforms,  or  for  per¬ 
forming  correlation  by  multiplication 
in  the  frequency  domain. 

A  block  diagram  of  the  CZT  chip 
is  shown  in  Figure  3,  and  a  photo¬ 
micrograph  of  the  IC  is  shown  in  Fig¬ 
ure  4.  The  IC  measures  6.04  x  5.69 
mm2  (238  x  224  mil2).  Details  re¬ 
lating  the  performance  of  the  32  poin 
CZT  IC  have  been  described  elsewhere. 


r1]  i 

i  j  •  '  ••  '••  •  ! 

I  |  ;  run  nviia 

I  1 .  f  |'B#I  S’ 
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nk 

* 
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32 

k  =  0 

62 
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.  s 

.k2 
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hk 

e 
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Fiqure  3.  Block 

Diagram  of  32  Po 

CCD  CZT  IC 

The 

coe 

ff  ic 

lents  are 

re  a  1  i  zed 

using 

the 

spl 

i  t-e 

■  ectrode 

technique . 

6  The 

CCD 

fil 

ters 

are  two 

phase , 

coplanar 

electrode  structures  with  ion  implant 
wells.  Operational  amplifiers  pro¬ 
vide  differential  inputs  and  outputs 
for  the  CCD  filters,  again  for  the 
purpose  of  versatility.  In  practice, 
the  filters  are  loaded  with  32  time 
domain  data  samples,  and  then  the 
input  is  blanked  while  the  conv- - 
lution  is  performed.  Thus,  a  sinqle 
chip  yields  a  DFT  output  with  a  50* 
duty  cycle.  In  this  mode  of  oper¬ 
ation,  the  4  MDAC ' s  provided  can  be 
multiplexed  to  do  both  pre-  and  post- 
mu  1  t  i  pi  ica  t  ion  .  If  100*  duty  cycle 
output  is  required,  two  chips  are 
used,  and  the  8  MDAC's  included  are 
sufficient  for  the  needed  multi- 
pi  ications . 

All  clock  waveforms  are  gene¬ 
rated  on  chip  from  a  two  phase  master 
clock,  and  the  system  timing  is  such 
that  it  is  possible  to  cascade 
CZT's  for  the  purpose  of  multi¬ 


r.rlvv  M 

1 1  ''  **  ’**  v-*' '...v . 

■W.Tir  I  «awi  ■  M t fitt kli]  IBB, 


iOD  ..ip.  it  ' 


. . 5 . 


Figure  4.  1’hotomicroqraph  of  32 
Point  CCD  CZT  IC. 


IV 


32  X  32  CCP  CTM 


Analog  CCD  reformatting  mem¬ 
ories  have  been  recently  applied  in 
high  speed  processors  for  pulse 
doppler  radar  in  conjunction  with 
surface  acoustic  wave  device  chirp 
transform  units.  I8-12'  These 
memories  are  predominantly  line- 
addressable  CCD  delay  line  struc¬ 
tures  which  exhibit  line-to-line 
offset  and  gain  variations  due  to 
fluctuations  in  MOS  transistor 
threshold  voltages  at  the  multiple 
input  and  output  ports.  Since  the 
doppler  information  is  typically 
used  on  a  line-by-line  basis, 
these  variations  can  be  tolerated 
to  some  extent.  However,  in  the 
application  described  here,  the 
information  content  of  an  entire 
frame  must  be  preserved,  and  the 
line-to-line  variations  constitute 
a  serious  dynamic  range  limitation. 

The  reformatting  or  "corner 
turning"  operation  can  be  visua¬ 
lized  as  loading  a  square  memory 
array  in  a  row-by-row  fashion, 
then  reading  it  column-by-column. 
The  most  direct  realization  in¬ 
volves  the  use  of  a  memory  struc¬ 
ture  in  which  charge  can  be  trans¬ 
ferred  either  horizontally  or 
vertically.'13'  This  realilzation 
is  further  enhanced  with  the 
addition  of  a  CCD  multiplexer  and 
demultiplexer  which  results  in  a 
single  input  port  and  output  port, 
thereby  minimizing  the  number  of 
voltage-charge  and  charge-voltage 
conversions,  and  their  trouble¬ 
some  threshold  dependence. 

A  block  diagram  illustrating 
the  operation  of  this  two  dimen¬ 
sional  CCD  memory  appears  in  Figure 
5.  The  input  demultiplexer  per¬ 
forms  a  serial  to  parallel  con¬ 
version  and  loads  the  memory  array 
row-by-row.  During  the  load  cycle 
charge  is  transferred  vertically 
in  the  memory  array.  When  all  rows 
have  been  loaded,  the  array  is 
switched  to  a  horizontal  transfer 
mode,  and  the  output  multiplexer 
performs  the  requisite  parallel 
to  serial  conversion  of  the  data 
stored  in  each  column. 


Charge  Transfer 
During  Read  in 

_ *  Charge  Transfer 

During  Readout 
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Input  . — 
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Figure  5.  Block  Diagram  of  the 
Corner  Turninq  Memory  (CTM)  * 

A  32  x  32  element  CCD  memory 
based  on  this  architecture  has  been 
designed  and  fabricated  using  the 
same  process  utilized  for  the  32 
point  CZT  IC.  A  photomicrograph 
of  the  chip  appears  in  Figure  6. 

In  order  to  permit  100%  duty  cycle 
in  other  applications,  an  additional 
multiplexer  and  demultiplexer  were 
included  at  the  bottom  and  right  of 
the  memory  array.  The  memory  array 
is  composed  of  1024  three-phase  two 
dimensional  charge  transfer  cells 
placed  on  2.0  mil  centers.  It 
operates  as  two  interleaved  two- 
phase  structures  with  one  common 
phase.  The  multiplexer  and  demul¬ 
tiplexer  are  four  phase  structures 
in  order  to  match  the  pitch  of  the 
memory  array  and  maintain  short 
transfer  lengths  for  qood  CTE. 

The  structure  is  buried  channel, 
with  the  exception  of  the  demulti¬ 
plexer  input  circuitry. 


1 1*5 


Fiqure  6.  Photomicrograph  of 
Corner  Turninq  Memory  (CTM) 

The  chip  has  been  success¬ 
fully  operated  at  data  rates  in 
excess  of  5  MHz.  Dynamic  ranqo 
is  still  dictated  by  threshold 
variations  in  the  multiplexer  and 
demultiplexer  channels  which 
result  in  "fixed  pattern"  noise 
components.  However,  nearly  40  dB 
dynamic  ranqe  (peak  siqnal  to  peak 
noise)  is  achieved  across  the 
entire  frame  as  compared  to  less 
than  20  dB  observed  in  a 
line  addressable  design.' 

V.  EVALUATION  OF  CCD  TWO  PI  MENS  IONA1. 

TRANSFORM 

Implementat ion 

A  detailed  block  diaqram  of  tie- 
CCD  two  dimensional  transform  is  shown 
in  Kiqure  7.  A  one  dimensional  DKT  is 
performed  on  the  rows  of  a  real  input 
siqnal,  f(m,n),  usinq  the  12  point  CZT 
IC.  The  real  and  imaq inary  Fourier 
coefficients,  C.  ^  ( m ,  , )  and  c,  j(m,t)  are 
reformatted  in  two  CCD  CTfts  resultinq 
in  transposed  matrices  r,_''(m,i)  and 
(■.|T(m,f).  A  second  one  dimensional 
transform  is  performed  on  the  trans¬ 
posed  data  from  the  CTMs .  The  final 
postmult iply  operation  has  been  re¬ 
placed  by  a  maqmtudmq  operation. 


i'fi  T v  1 OIJS 


All  timinq  pulses  needed  for  the 
CTM  operation  are  qenerated  externally 
usinq  standard  TTL  circuits  and  TTL- 
Mos  drivers.  This  control  circuitry 
also  provides  the  two  phase  clocks  and 
synchronization  pulses  needed  for  the 
CCD  07T  lCr.  Due  to  a  photomask  error, 
the  tapweiqhts  of  the  CCD  filters  on 
the  CZT  IC  wore  shifted  one  stage 
thereby  shiftinq  the  output  sequence 
of  the  Fourier  coefficients  by  one 
position.  This  also  precludes  use  of 
the  on  chip  MPACs  for  postmultipli¬ 
cation  operations.'4'  There  fori'  the 
MPACs  of  another  CZT  IC  using  a 
delayed  sync  pulse  were  used  to  per¬ 
form  the  postmu  1 1  ipl  icat. ion  needed  in 
the  first  one  dimensional  transform. 

In  order  to  facilitate  inter¬ 
facing  the  analoq  circuitry  and  to 
simplify  system  timing  and  synch¬ 
ronization,  evaluation  of  this 
demonstration  unit  was  performed 
at  a  100  kHz.  data  rate. 

Kxper l mental  Resu 1 ts 

A  simple  two  dimensional  input 
test  signal  was  qenerated  by  multi¬ 
plying  two  analog  signals  together 
in  a  4  quadrant  analog  multiplier. 

One  signal  provides  variations 
along  the  horizontal  axis  and  cor¬ 
responds  to  the  rows  of  f(m,n).  The 
second  siqnal  is  sampled  once  per 
row  and  amplitude  modulates  the  first 
signal  to  provide  variations  alonq 
the  vertical  axis  and  thus  alonq  the 
columns  ol  f(m,nl.  An  example  of 
the  two  dimensional  input  siqnal  is 
shown  in  Figure  R  for  sinusoidal 
signals.  The  signals  are  synch¬ 
ronized  by  the  master  timinq  to  pro¬ 
vide  phase  lock  for  the  photographs. 
Figure  Ha  shows  the  input  siqnal 
for  a  complete  f  rami',  i.e.,  32  rows 

and  12  columns.  Figure  Rh  is  an  ex¬ 
panded  view  of  the  input  for  one  row. 
In  addition,  a  lx  offset  was  added 
to  the  two  dimensional  fest  siqnal 
in  order  to  mark  I  lie  origin  F(  1  ,  1  1 
of  the  two  dimensional  transform. 
Figures  •»  through  1?  illustrate  the 
operation  of  the  two  dimensional 
t  tan;. form  with  I  h"  input  siqnal  of 
Figure  B  . 


I  U6 


The  real  and  imaginary  Fourier 
coefficients,  GR(m, t)  and  Gj(m,C) 
from  the  first  one  dimensional  trans¬ 
form  are  shown  in  Figure  9  for  both 
the  entire  frame  and  for  one  row. 

The  output  of  this  one  dimensional 
transform  for  one  frame  consists  of 
32  groups  of  32  Fourier  coefficients, 
one  group  for  each  row  of  the  input 
signal  f(m,n).  The  Fourier  coeffi¬ 
cients  in  each  group  appear  in  the 
order  32,  1,  2,  3,  ...,  31  due  to 
the  photomask  error  previously 
mentioned.  The  horizontal  input 
signal  consists  of  six  cycles  of  a 
sinusoid.  Therefore  the  transform 
of  a  row  results  in  non  zero  values 
for  the  7th  and  27th  Fourier 
coefficients  as  shown  in  Figure  9b. 
The  real  and  imaginary  outputs  are 
90*  out  of  phase,  thus  one  is  at  its 
peak  value  when  the  other  is  zero. 
Also  seen  in  Figure  9b  is  a  non  zero 
value  for  the  first  Fourier  coeffi¬ 
cient  of  the  real  output  due  to  the 
»C  offset  of  the  input  signal.  Note 
that  with  the  exception  of  the  DC 
coefficient  the  non  zero  Fourier  co¬ 
efficients  are  modulated  by  the  ver¬ 
tical  signal  as  seen  in  Figure  9a. 

Figure  10  illustrates  the  output 
of  the  corner  turning  memory  GMm,t). 
The  corner  turning  operation  trans¬ 
poses  the  coefficients  previously 
stored,  G(m,£),  thus  each  consecutive 
group  of  32  points  from  the  CTM  con¬ 
sists  of  the  data  in  one  column  of 
G(m,(J,  i.e.,  the  32  values  of  a 
single  Fourier  coefficient  previously 
obtained  by  transforming  the  rows  of 
f(m,n).  This  matrix  transpose  oper¬ 
ation  is  confirmed  by  comparing  the 
CTM  outputs  in  Figure  10  to  the  CTM 
inputs  in  Figure  9.  Figure  10a  shows 
a  complete  frame  of  transposed  data. 
Figure  10b  is  an  expanded  view  of  the 
8th  group  of  32  points  from  the  CTM. 
This  corresponds  to  the  32  values  of 
the  7th  Fourier  coefficient  which  is 
the  8th  column  of  G(m,f). 

The  final  output  of  the  two 
dimensional  transform  unit  is  the 
magnitude  I F ( k ,  ' ) I  which  is  seen  in 
Figure  11.  This  is  obtained  by 
generating  the  one  dimensional  DKT 
of  each  of  the  32  groups  of  32  points 
from  the  CTM.  This  corresponds  to 
obtaining  the  transform  of  each 


column  of  the  matrix  G(m,{).  From 
Figure  10a  it  can  be  seen  that  only 
the  2nd,  8th,  and  28th  groups  from 
the  CTM  contain  non  zero  coefficient 
These  are  due  to  the  input  DC  offset 
and  the  horizontal  sinusoid.  Thus 
the  second  DFT  yields  non  zero  co¬ 
efficients  only  in  the  1st,  7th,  and 
27th  rows  of  F(k,£).  The  second 
group  from  the  CTM  represents  the 
unmodulated  DC  offset.  The  DFT 
of  this  group  yields  only  a  non 
zero  DC  coefficient  F(l,l).  The 
The  8th  and  28th  groups  have  been 
modulated  by  the  vertical  input 
signal  which  consists  of  two  cycles 
of  a  sinusoid.  The  DFT  of  each  of 
these  groups  yields  non  zero  values 
for  the  3rd  and  31st  Fourier  coeffi¬ 
cients  in  rows  7  and  27  of  F(k,t), 
i.e.,  F ( 7 , 3 )  ,  F( 7 , 3 1 ) ,  F(27,3),  and 
F(27,31).  Figure  lib  is  an  expanded 
view  of  the  DFT  of  the  8th  group 
from  the  CTM  showing  IF(7,3)|  and 
I F ( 7 , 31 )  I  . 

In  order  to  conveniently  demon¬ 
strate  the  operation  of  the  two 
dimensional  transform,  a  display 
circuit  was  employed  to  apply  the 
desired  signal  to  the  Z  axis  of  an 
oscilloscope.  The  X  axis  sweep  was 
triggered  by  a  row  synchronization 
pulse  and  a  frame  synchronized  stair¬ 
case  waveform  was  applied  to  the  Y 
axis.  A  short  pulse  added  to 
IF(k,£)|  before  each  coefficient 
output  displays  a  two  dimensional 
grid.  Fach  value  of  IF(k,f)l  appears 
as  a  line  after  its  corresponding 
grid  point.  Figure  12  shows  the  dis¬ 
play  of  the  input  signal  used  in 
Figures  8  through  11  and  its  two 
dimensional  transform.  The  input 
signal  is  the  product  of  six  cycles 
of  a  sinusoid  in  the  horizontal 
direction  and  two  cycles  of  a  sinu¬ 
soid  in  the  vertical  direction  with 
a  DC  offset  added  to  the  product. 

The  display  of  the  two  dimensional 
transform  magnitude  shows  the  IK 
coefficient  F(l,l)  in  the  upper  left 
corner  at  position  (2,21  of  the  32  x 
32  array.  The  Fourier  coefficient 
Ft  7, 3)  due  to  the  input  product  can 
be  seen  at  position  (8,4)  of  the 
array.  (Remember  that  all  outputs 
are  shifted  by  one  bit  in  both 
directions  due  to  the  CZT  photomask 
error.)  The  three  aliased  outputs 


F( 7 , 31 ) ,  F(  27 , 3 ) ,  and  F(27,31)  can 
also  be  seen. 


(a)  Sinusoidal 

Inputs  f(m,n) 


(b)  Output 
I F  (  k  ,  e. )  I 


Figure  12.  Two  Dimensional  Display  of 
the  DFT  Output  for  Sinusoidal  Inputs. 


Figure  13  shows  the  operation  of 
the  two  dimensional  DFT  with  a  square 
wave  product  input.  The  horizontal 
input  seen  in  Figure  13a  is  two  cycles 
of  a  square  wave  and  the  vertical 
signal  is  one  cycle.  A  DC  offset  has 
been  added.  The  DFT  output  seen  in 
Figure  13b  clearly  shows  the  funda¬ 
mental  and  odd  harmonic  components 
due  to  each  input.  The  aliased  funda¬ 
mental  from  the  vertical  square  wave 
appears  in  the  first  column  of  the 
display  due  to  the  C7.T  photomask  error. 
The  DC  components  from  the  2nd  CZT 
seen  in  column  two  of  the  display  are 
due  to  the  -36  dR  fixed  pattern  noise 
in  the  CTM.  Figure  14  shows  lF(k,K)l 
for  k=3,  i.e,  the  4th  row  of  the 
display.  An  imbalance  in  the  output 
maqnituding  circuit  used  causes  the 
nonuniformity  seen  in  the  harmonics. 


(a)  Square  Wave  (b)  Output 

Inputs  f (m ,n )  I F( k ,  f  )  I 


Figure  13.  Two  Dimensional  Display  of 
the  DFT  Output  for  Square  Wave  Inputs. 


Figure  14.  One  Row  of  the  DFT  Output 

The  dynamic  range  of  the  two 
dimensional  transform  is  presently 
limited  by  the  corner  turning  memory 
which  exhibits  36  dB  dynamic  range 
due  to  a  fixed  pattern  noise  component 
discussed  earlier.  The  dynamic  range 
of  the  CCD  CZT  IC  has  been  demon¬ 
strated  to  be  50  dB.  The  power  re¬ 
quired  for  the  complete  two  dimen¬ 
sional  transform  breadboard  was  10 
watts.  This  includes  all  timing  cir¬ 
cuitry,  external  amplifiers,  drivers, 
MDACs,  etc.  Improvements  to  the  CZT 
and  CTM  IC  and  further  integration 
can  be  expected  to  reduce  the  power 
requirements  and  increase  dynamic 
range . 


CONCLUSIONS 

Preliminary  results  from  a  two 
dimensional  DFT  breadboard  demon¬ 
strated  the  potential  in  applying 
analog  CCD  technology  to  the  imple¬ 
mentation  of  low  power,  high  speed 
complex  two  dimensional  signal  pro¬ 
cessing  algorithms.  Although  100  kHz 
is  not  an  aggressive  data  rate  for 
CCD  circuits,  it  results  in  computa¬ 
tion  of  the  32  x  32  point  two  dimen¬ 
sional  DFT  (1024  total  points)  in 
0.4  msec.  A  digital  FFT  processor 
would  require  an  effective  multiply- 
accumulate  time  of  80  nsec  to  achieve 
the  same  rate.  It  is  anticipated 
that  CCD  two  dimensional  transforms 
can  be  performed  at  clock  rates  of 
several  megahertz  with  a  moderate 
increase  in  system  complexity. 
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